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Introduction 


The  emphasis  in  this  course  is  on  problems — doing  calculations  and  story  problems.  To  master  problem 
solving  one  needs  a  tremendous  amount  of  practice  doing  problems.  The  more  problems  you  do  the 
better  you  will  be  at  doing  them,  as  patterns  will  start  to  emerge  in  both  the  problems  and  in  successful 
approaches  to  them.  You  will  learn  quickly  and  effectively  if  you  devote  some  time  to  doing  problems 
every  day. 

Typically  the  most  difficult  problems  are  story  problems,  since  they  require  some  effort  before  you  can 
begin  calculating.  Here  are  some  pointers  for  doing  story  problems: 

1.  Carefully  read  each  problem  twice  before  writing  anything. 

2.  Assign  letters  to  quantities  that  are  described  only  in  words;  draw  a  diagram  if  appropriate. 

3.  Decide  which  letters  are  constants  and  which  are  variables.  A  letter  stands  for  a  constant  if  its  value 
remains  the  same  throughout  the  problem. 

4.  Using  mathematical  notation,  write  down  what  you  know  and  then  write  down  what  you  want  to 
find. 

5.  Decide  what  category  of  problem  it  is  (this  might  be  obvious  if  the  problem  comes  at  the  end  of  a 
particular  chapter,  but  will  not  necessarily  be  so  obvious  if  it  comes  on  an  exam  covering  several 
chapters). 

6.  Double  check  each  step  as  you  go  along;  don’t  wait  until  the  end  to  check  your  work. 

7.  Use  common  sense;  if  an  answer  is  out  of  the  range  of  practical  possibilities,  then  check  your  work 
to  see  where  you  went  wrong. 
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1.  Review 


Success  in  calculus  depends  on  your  background  in  algebra,  trigonometry,  analytic  geometry  and  func¬ 
tions.  In  this  chapter,  we  review  many  of  the  concepts  you  will  need  to  know  to  succeed  in  this  course. 


1.1  Algebra 


1.1.1.  Sets  and  Number  Systems 


A  set  can  be  thought  of  as  any  collection  of  distinct  objects  considered  as  a  whole.  Typically,  sets  are  rep¬ 
resented  using  set-builder  notation  and  are  surrounded  by  braces.  Recall  that  (,)  are  called  parentheses 
or  round  brackets;  [,]  are  called  square  brackets;  and  {, }  are  called  braces  or  curly  brackets. 


Example  1.1:  Sets 


The  collection  {a,b,  1,2}  is  a  set.  It  consists  of  the  collection  of  four  distinct  objects,  namely,  a,  b, 
1  and  2. 


Let  S  be  any  set.  We  use  the  notation  x  e  S  to  mean  that  x  is  an  element  inside  of  the  set  S,  and  the 
notation  x  ^  S  to  mean  that  x  is  not  an  element  of  the  set  S. 


The  intersection  between  two  sets  S  and  T  is  denoted  by  S  ft  T  and  is  the  collection  of  all  elements 
that  belong  to  both  S  and  T.  The  union  between  two  sets  S  and  T  is  denoted  by  S  U  T  and  is  the  collection 
of  all  elements  that  belong  to  either  S  or  T  (or  both). 


Numbers  can  be  classified  into  sets  called  number  systems. 
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N 

the  natural  numbers 

{1,2,3,...} 

Z 

the  integers 

{...,-3, -2, -1,0, 1,2,3,...} 

Q 

the  rational  numbers 

Ratios  of  integers:  j  |  :  p,q  6  Z,q  /  0  j 

M 

the  real  numbers 

Can  be  written  using  a  finite  or  infinite  decimal  expansion 

C 

the  complex  numbers 

These  allow  us  to  solve  equations  such  as  x2  +  1  =  0 

In  the  table,  the  set  of  rational  numbers  is  written  using  set-builder  notation.  The  colon,  :,  used  in  this 
manner  means  such  that.  Often  times,  a  vertical  bar  |  may  also  be  used  to  mean  such  that.  The  expression 

1 1  :  p,q  0  j  can  be  read  out  loud  as  the  set  of  all  fractions  p  over  q  such  that  p  and  q  are  both 

integers  and  q  is  not  equal  to  zero. 


Example  1.4:  Rational  Numbers 


The  numbers  —  |,  2.647,  17,  0.7  are  all  rational  numbers.  You  can  think  of  rational  numbers  as 
fractions  of  one  integer  over  another.  Note  that  2.641  can  be  written  as  a  fraction: 


2.647 


2.647  x 


1000 

Tooo 


2647 

1000' 


Also  note  that  in  the  expression  0.7,  the  bar  over  the  7  indicates  that  the  7  is  repeated  forever: 

7 

0.77777777 ...  =  -. 
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All  rational  numbers  are  real  numbers  with  the  property  that  their  decimal  expansion  either  terminates 
after  a  finite  number  of  digits  or  begins  to  repeat  the  same  finite  sequence  of  digits  over  and  over.  Real 
numbers  that  are  not  rational  are  called  irrational. 


Example  1.5:  Irrational  Numbers 


Some  of  the  most  common  irrational  numbers  include: 

•  s/2.  Can  you  prove  this  is  irrational?  (The  proof  uses  a  technique  called  contradiction .) 

•  K.  Recall  that  K  (pi)  is  defined  as  the  ratio  of  the  circumference  of  a  circle  to  its  diameter 
and  can  be  approximated  by  3.14159265. 

•  e.  Sometimes  called  Euler’s  number,  e  can  be  approximated  by  2.718281828459.  We  will 
review  the  definition  of  e  in  a  later  chapter. 


Let  S  and  T  be  two  sets.  If  every  element  of  S  is  also  an  element  of  T,  then  we  say  S  is  a  subset  of  T 
and  write  S  C  T .  Furthermore,  if  S  is  a  subset  of  T  but  not  equal  to  T,  we  often  write  S  C  T .  The  five  sets 
of  numbers  in  the  table  give  an  increasing  sequence  of  sets: 
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That  is,  all  natural  numbers  are  also  integers,  all  integers  are  also  rational  numbers,  all  rational  numbers 
are  also  real  numbers,  and  all  real  numbers  are  also  complex  numbers. 

1.1.2.  Law  of  Exponents 


The  Law  of  Exponents  is  a  set  of  rules  for  simplifying  expressions  that  governs  the  combination  of  expo¬ 
nents  (powers).  Recall  that  nj~  denotes  the  nth  root.  For  example  \/8  =  2  represents  that  the  cube  root  of 
8  is  equal  to  2. 


In  the  next  example,  the  word  simplify  means  to  make  simpler  or  to  write  the  expression  more  com¬ 
pactly. 


Example  1.7:  Laws  of  Exponents 


Simplify  the  following  expression  as  much  as  possible  assuming  x,y  >  0: 
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Solution.  Using  the  Law  of  Exponents,  we  have: 


3x  2y3x  .  i 

- — ,  since  -Jx—x2, 

yAx  2 

3x  2yx  .  v3 

- y — ,  since  ~2  =  y, 

x2  y 

3v  .  x~2x  X-1  _  3  1 

-3-,  since  — —  =  — j-  —x  2  —  — , 

X2  X2  X2  X2 

3  y  .  3  33 

— =,  since  x2  —  VxL 

An  answer  of  3yx~  3/2  is  equally  acceptable,  and  such  an  expression  may  prove  to  be  computationally 
simpler,  although  a  positive  exponent  may  be  preferred.  X 


1.1.3.  The  Quadratic  Formula  and  Completing  the  Square 


3x~2y3x 

y2V* 


The  technique  of  completing  the  square  allows  us  to  solve  quadratic  equations  and  also  to  determine  the 
center  of  a  circle/ellipse  or  the  vertex  of  a  parabola. 

The  main  idea  behind  completing  the  square  is  to  turn: 

'j 

ax~  +  bx  +  c 


into 


a(x  —  h)2  +k. 

One  way  to  complete  the  square  is  to  use  the  following  formula: 

•7  (  b\2  b2 

ax  +bx  +  c  =  a\  x+  —  \  — +  c. 

V  2  a)  4  a2 

But  this  formula  is  a  bit  complicated,  so  some  students  prefer  following  the  steps  outlined  in  the  next 
example. 


Example  1.8:  Completing  the  Square 


Solve  lx2  +  12x  —  32  =  0  by  completing  the  square. 


Solution.  In  this  instance,  we  will  not  divide  by  2  first  (usually  you  would)  in  order  to  demonstrate  what 
you  should  do  when  the  ‘a’  value  is  not  1. 
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2x2  +  12x  —  32  =  0  Start  with  original  equation. 

2x2  +  12x  =  32  Move  the  number  over  to  the  other  side. 

2(x2  +  6x)  =  32  Factor  out  the  a  from  the  ax 2  +  bx  expression. 

6  — »  |  =  3  — »  32  =  9  Take  the  number  in  front  of  x, 

divide  by  2, 
then  square  it. 

2(x2  +  6x  +  9)  =32  +  2-9  Add  the  result  to  both  sides, 

taking  a  =  2  into  account. 

2(x  +  3)2  =  50  Factor  the  resulting  perfect  square  trinomial. 

You  have  now  completed  the  square! 

(x  +  3)2  =  25  — *  x  =  2  or  x  =  —8  To  solve  for  x,  simply  divide  by  a  —  2 

and  take  square  roots. 


Suppose  we  want  to  solve  for  x  in  the  quadratic  equation  ax2 +  bx  +  c  —  0,  where  a  ^  0.  The  solution(s) 
to  this  equation  are  given  by  the  quadratic  formula. 


The  Quadratic  Formula 

The  solutions  to  ax2  +  bx  +  c  =  0  (with  a  /  0)  are  x  = 

— b±  sjb2  —  4 ac 

2  a 

Proof.  To  prove  the  quadratic  formula  we  use  the  technique  of  completing  the  square.  The  general  tech¬ 
nique  involves  taking  an  expression  of  the  form  x2  +  rx  and  trying  to  find  a  number  we  can  add  so  that  we 
end  up  with  a  perfect  square  (that  is,  (x  +  n)2).  It  turns  out  if  you  add  (r/2)2  then  you  can  factor  it  as  a 
perfect  square. 

For  example,  suppose  we  want  to  solve  for  x  in  the  equation  ax2  +  bx  +  c  —  0,  where  a  ^  0.  Then  we 
can  move  c  to  the  other  side  and  divide  by  a  (remember,  a  ^  0  so  we  can  divide  by  it)  to  get 

9  b  c 

x  +  -x  =  — . 
a  a 

To  write  the  left  side  as  a  perfect  square  we  use  what  was  mentioned  previously.  We  have  r  =  (b/a)  in  this 
case,  so  we  must  add  {r/2)2  —  {b/2a)2  to  both  sides 

^+(r)2=--+(r)2- 

a  \2a )  a  \2a  / 

We  know  that  the  left  side  can  be  factored  as  a  perfect  square 

+  AY  =  -£  +  (1)2, 

2a )  a  \2  a  J 
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The  right  side  simplifies  by  using  the  exponent  rules  and  finding  a  common  denominator 


-4  ac  +  b2 
4a 2 


Taking  the  square  root  we  get 

b  I —4ac  +  b2 

2a  ~  ~v  4a2  ’ 

which  can  be  rearranged  as 

— b  ±  \Jb2  —  4 ac 


In  essence,  the  quadratic  formula  is  just  completing  the  square. 


1.1.4.  Inequalities,  Intervals  and  Solving  Basic  Inequalities 


X 


Inequality  Notation 

Recall  that  we  use  the  symbols  <,>,<,>  when  writing  an  inequality.  In  particular, 

•  a  <  b  means  a  is  to  the  left  of  b  (that  is,  a  is  strictly  less  than  b), 

•  a  <b  means  a  is  to  the  left  of  or  the  same  as  b  (that  is,  a  is  less  than  or  equal  to  b), 

•  a>  b  means  a  is  to  the  right  of  b  (that  is,  a  is  strictly  greater  than  b), 

•  a>b  means  a  is  to  the  right  of  or  the  same  as  b  (that  is,  a  is  greater  than  or  equal  to  b). 

To  keep  track  of  the  difference  between  the  symbols,  some  students  use  the  following  mnemonic. 


Mnemonic 


The  <  symbol  looks  like  a  slanted  L  which  stands  for  “Less  than”. 


The  real  numbers  are  ordered  and  are  often  illustrated  using  the  real  number  line: 


y/2 


it 


-4 


-3 


-2 


-1 


-2.5 
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Intervals 


Assume  a,b  are  real  numbers  with  a  <  b  (i.e.,  a  is  strictly  less  than  b).  An  interval  is  a  set  of  every  real 
number  between  two  indicated  numbers  and  may  or  may  not  contain  the  two  numbers  themselves.  When 
describing  intervals  we  use  both  round  brackets  and  square  brackets. 

(1)  Use  of  round  brackets  in  intervals:  ( ,  ).  The  notation  (a,b)  is  what  we  call  the  open  interval  from 
a  to  b  and  consists  of  all  the  numbers  between  a  and  b ,  but  does  not  include  a  or  b.  Using  set-builder 
notation  we  write  this  as: 

(a,b)  —  {x  G  M  :  a  <  x  <  b}. 

We  read  {x  6  R  :  a  <  x  <  b}  as  “the  set  of  real  numbers  x  such  that  x  is  greater  than  a  and  less  than  b ”  On 
the  real  number  line  we  represent  this  with  the  following  diagram: 

- o  o - 

a  b 

Note  that  the  circles  on  a  and  b  are  not  shaded  in,  we  call  these  open  circles  and  use  them  to  denote  that 
a,  b  are  omitted  from  the  set. 

(2)  Use  of  square  brackets  in  intervals:  [  ,  ].  The  notation  [a,b]  is  what  we  call  the  closed  interval 
from  a  to  b  and  consists  of  all  the  numbers  between  a  and  b  and  including  a  and  b.  Using  set-builder 
notation  we  write  this  as 

[a,b\  —  {x  G  M  |  a  <  x  <  b}. 

On  the  real  number  line  we  represent  this  with  the  following  diagram: 

a  b 

Note  that  the  circles  on  a  and  b  are  shaded  in,  we  call  these  closed  circles  and  use  them  to  denote  that  a 
and  b  are  included  in  the  set. 

To  keep  track  of  when  to  shade  a  circle  in,  you  may  find  the  following  mnemonic  useful: 


Mnemonic 


The  round  brackets  (, )  and  non-shaded  circle  both  form  an  “O”  shape  which  stands  for  “Open  and 
Omit”. 


Taking  combinations  of  round  and  square  brackets,  we  can  write  different  possible  types  of  intervals 
(we  assume  a  <  b): 


(a,b)  —  {x  G  R  :  a  <  x  <  b} 

[a,b\  —  {x  G  R  :  a  <  x  <  b} 

[a,  b)  =  {x  6  t  :  a  <  x  <  b} 

a  b 

a  b 

a  b 

{a,b\  ={x6t  :  a  <  x  <  b} 

(a,“)  =  {x6l  :  x  >  a] 

[o,“)  =  {x  6  1  :  x  >  a} 

a  b 

a 

a 

(—  o°,b)  —  {x  G  R  :  x  <  b} 

(—  o°,b\  —  {x  G  ®  :  x  <  b} 

(—  oo?oo )  =  M  =  all  real  numbers 

b 

b 
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Note:  Any  set  which  is  bound  at  positive  and/or  negative  infinity  is  an  open  interval. 

Inequality  Rules 


Before  solving  inequalities,  we  start  with  the  properties  and  rules  of  inequalities. 


Inequality  Rules 


Add/subtract  a  number  to  both  sides: 

•  If  a  <  b,  then  a  +  c  <  b  +  c  and  a  —  c  <  b  —  c. 

Adding  two  inequalities  of  the  same  type: 

•  If  a  <C  b  and  c  <t  d,  then  a  -\ -  c  <C  b  -\-  d. 

Add  the  left  sides  together,  add  the  right  sides  together. 

Multiplying  by  a  positive  number: 

•  Let  c  >  0.  If  a  <  b,  then  c-a  <  c-b. 

Multiplying  by  a  negative  number: 

•  Let  c  <  0.  If  a  <  b,  then  c-  a>  c-b. 

Note  that  we  reversed  the  inequality  symbol! 


Similar  rules  hold  for  each  of  <,  >  and  >. 

Solving  Basic  Inequalities 


We  can  use  the  inequality  rules  to  solve  some  simple  inequalities. 


Example  1.10:  Basic  Inequality 


Find  all  values  ofx  satisfying 

3x+  1  >  2x  —  3. 

Write  your  answer  in  both  interval  and  set-builder  notation.  Finally,  draw  a  number  line  indicating 
your  solution  set. 


Solution.  Subtracting  2x  from  both  sides  gives  x  +  1  >  —3.  Subtracting  1  from  both  sides  gives  x  >  —4. 
Therefore,  the  solution  is  the  interval  (—4,°°).  In  set-builder  notation  the  solution  may  be  written  as 
{x  €  R  :  x  >  —4}.  We  illustrate  the  solution  on  the  number  line  as  follows: 


-o 

-4 


X 
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Sometimes  we  need  to  split  our  inequality  into  two  cases  as  the  next  example  demonstrates. 


Solution.  We  need  both  4  >  3x  —  2  and  3x  —  2  >  2x  —  1  to  be  true: 


4  >  3x  —  2 

and 

3x  —  2  >  2x  —  1 , 

6  >  3x 

and 

x  —  2  >  — 1, 

2  >  x 

and 

x  >  1, 

x  <  2 

and 

x  >  1. 

Thus,  we  require  x  >  1  but  also  x  <  2  to  be  true.  This  gives  all  the  numbers  between  1  and  2,  including 
1  but  not  including  2.  That  is,  the  solution  to  the  inequality  4  >  3x  —  2  >  2x—  1  is  the  interval  [1,2).  In 
set-builder  notation  this  is  the  set  {x  G  M  :  1  <  x  <  2}.  X 


Solution.  We  provide  two  methods  to  solve  this  inequality. 

First  method.  Factor  4x  —  x2  as  x(4  —  x) .  The  product  of  two  numbers  is  positive  when  either  both  are 
positive  or  both  are  negative,  i.e.,  if  either  x  >  0  and  4  —  x  >  0,  or  else  x  <  0  and  4  —  x  <  0.  The  latter 
alternative  is  impossible,  since  if  x  is  negative,  then  4  —  x  is  greater  than  4,  and  so  cannot  be  negative.  As 
for  the  first  alternative,  the  condition  4  —  x  >  0  can  be  rewritten  (adding  x  to  both  sides)  as  4  >  x,  so  we 
need:  x  >  0  and  4  >  x  (this  is  sometimes  combined  in  the  form  4  >  x  >  0,  or,  equivalently,  0  <  x  <  4).  In 
interval  notation,  this  says  that  the  solution  is  the  interval  (0,4). 

Second  method.  Write  4x  —  x2  as  —  (x2  —  4x),  and  then  complete  the  square,  obtaining 

-((x-2)2  — 4)  =4-(x  — 2)2. 

For  this  to  be  positive  we  need  (x  —  2) 2  <  4,  which  means  that  x  —  2  must  be  less  than  2  and  greater  than 
—2:  —2  <  x  —  2  <  2.  Adding  2  to  everything  gives  0  <  x  <  4. 

Both  of  these  methods  are  equally  correct;  you  may  use  either  in  a  problem  of  this  type.  X 

We  next  present  another  method  to  solve  more  complicated  looking  inequalities.  In  the  next  example 
we  will  solve  a  rational  inequality  by  using  a  number  line  and  test  points.  We  follow  the  guidelines  below. 
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Guidelines  for  Solving  Rational  Inequalities 


1.  Move  everything  to  one  side  to  get  a  0  on  the  other  side. 

2.  If  needed,  combine  terms  using  a  common  denominator. 

3.  Factor  the  numerator  and  denominator. 

4.  Identify  points  where  either  the  numerator  or  denominator  is  0.  Such  points  are  called  split 
points. 

5.  Draw  a  number  line  and  indicate  your  split  points  on  the  number  line.  Draw  closed/open 
circles  for  each  split  point  depending  on  if  that  split  point  satisfies  the  inequality  (division  by 
zero  is  not  allowed). 

6.  The  split  points  will  split  the  number  line  into  subintervals.  For  each  subinterval  pick  a  test 
point  and  see  if  the  expression  in  Step  3  is  positive  or  negative.  Indicate  this  with  a  +  or  — 
symbol  on  the  number  line  for  that  subinterval. 

7.  Now  write  your  answer  in  set-builder  notation.  Use  the  union  symbol  U  if  you  have  multiple 
intervals  in  your  solution. 


Example  1.13:  Rational  Inequality 


Write  the  solution  to  the  following  inequality  using  interval  notation: 


2  —  x 
2  +  x 


>  1. 


Solution  One  method  to  solve  this  inequality  is  to  multiply  both  sides  by  2+x,  but  because  we  do  not 
know  if  2  +  x  is  positive  or  negative  we  must  split  it  into  two  cases  (Case  1:  2  +  x  >  0  and  Case  2: 
2+x  <  0). 

Instead  we  follow  the  guidelines  for  solving  rational  inequalities: 

2  —  x 


Start  with  original  problem: 
Move  everything  to  one  side: 

Find  a  common  denominator: 

Combine  fractions: 

Expand  numerator: 
Simplify  numerator: 


2  +  x 
2  —  x 


>  1 


1  >0 


2  +  x 
2— x  2+x 


>0 


2+x  2+x 

(2  — x)  -  (2+x) 
2  +  x 

2-x-2~x>0 

2+x 

— 2x  ^  ,  . 

>0  * 


>0 


2  +  x 
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Now  we  have  the  numerator  and  denominator  in  fully  factored  form.  The  split  points  are  x  —  0  (makes 
the  numerator  0)  and  x  —  —2  (makes  the  denominator  0).  Let  us  draw  a  number  line  with  the  split  points 
indicated  on  it: 

- 1 - 1 - 

-2  0 

The  point  x  —  0  is  included  since  if  we  sub  x  =  0  into  (*)  we  get  0  >  0  which  is  true.  The  point  x  —  —2  is 
not  included  since  we  cannot  divide  by  zero.  We  indicate  this  with  open/closed  circles  on  the  number  line 
(remember  that  open  means  omit): 


- o - • - 

-2  0 

Now  choosing  a  test  point  from  each  of  the  three  subintervals  we  can  determine  if  the  expression  is 
positive  or  negative.  When  x  —  —3,  it  is  negative.  When  jc  =  —  1 ,  it  is  positive.  When  x—  1,  it  is  negative. 
Indicating  this  on  the  number  line  gives: 


- o - * - 

-2  0 

Since  we  wish  to  solve  >  0,  we  look  at  where  the  +  signs  are  and  shade  that  area  on  the  number  line: 

- ± - - 

-2  0 

Since  there  is  a  closed  circle  at  0,  we  include  it.  Therefore,  the  solution  is  (—2,0].  X 


Solution.  We  provide  a  brief  outline  of  the  solution.  By  subtracting  (3v  +  3)  from  both  sides  and  using  a 
common  denominator  of  x  +  2,  we  can  collect  like  terms  and  simplify  to  get: 

—  (3x2  +  9.t  +  4)  ; 
x  -T  2 

The  denominator  is  zero  when  x  =  —2.  Using  the  quadratic  formula,  the  numerator  is  zero  when  x  = 
-9±\/33  (thggg  two  numbers  are  approximately  —2.46  and  —0.54).  Since  the  inequality  uses  “>”  and 
0  >  0  is  false,  we  do  not  include  any  of  the  split  points  in  our  solution.  After  choosing  suitable  test  points 
and  determining  the  sign  of  ~  ^3~\.+2A+4^  we  have 


+ 


— o — 
-2.46 


-o* 

-2 


+ 


— o— 
-0.54 
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Looking  where  the  +  symbols  are  located  gives  the  solution: 


-9 -a/33 


—9  + a/33  ' 


When  writing  the  final  answer  we  use  exact  expressions  for  numbers  in  mathematics,  not  approximations 
(unless  stated  otherwise).  & 


1.1.5.  The  Absolute  Value 


The  absolute  value  of  a  number  x  is  written  as  |x|  and  represents  the  distance  x  is  from  zero.  Mathemati¬ 
cally,  we  define  it  as  follows: 


x,  if  x  >  0, 
— x,  if  x  <  0. 


Thus,  if  x  is  a  negative  real  number,  then  —  x  is  a  positive  real  number.  The  absolute  value  does  not  just 
turn  minuses  into  pluses.  That  is,  \2x—  1 1  ^  2x+  1.  You  should  be  familiar  with  the  following  properties. 


r  ^ 

Absolute  Value  Properties 

1. 

* 

IV 

p 

2. 

M  =  Wbl 

3. 

l/x  =  1/  x  when  x  ^  0. 

4. 

|  —  x  =  x  . 

5. 

x  +  y  <  x 

+  y  .  This  is  called 

the  triangle  inequality. 

6. 

a/?  =  x  . 

When  solving  inequalities  with  absolute  values,  the  following  are  helpful. 
Case  1:  a  >  0. 

•  \x\—  a  has  solutions  x  —  ±a. 

•  |x|  <  a  means  x>  —a  and  x  <  a  (that  is,  —a  <  x  <  a). 

•  \x\  <  a  means  x  <  —a  and  x  <  a  (that  is,  —a<x<  a). 

•  \x\>  a  means  x  <  —a  or  x  >  a. 
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•  \x\  >  a  means  x  <  —a  or  x  >  a. 

Case  2:  a  <  0. 

•  \x\—  a  has  no  solutions. 

•  Both  |x|  <  a  and  |x|  <  a  have  no  solutions. 

•  Both  |x|  >  a  and  |x|  >  a  have  solution  set  {x|x  G  M}. 

Case  3:  a  —  0. 

•  |x|  =  0  has  solution  x  =  0. 

•  |x|  <  0  has  no  solutions. 

•  |x|  <  0  has  solution  x  =  0. 

•  |x|  >  0  has  solution  set  {x  G  R|x  ^  0}. 

•  |x|  >  0  has  solution  set  {x|x  G  M}. 

1.1.6.  Solving  Inequalities  that  Contain  Absolute  Values 


We  start  by  solving  an  equality  that  contains  an  absolute  value.  To  do  so,  we  recall  that  if  a  >  0  then  the 
solution  to  |x|  —  a  is  x  =  ±a.  In  cases  where  we  are  not  sure  if  the  right  side  is  positive  or  negative,  we 
must  perform  a  check  at  the  end. 


Solution.  This  means  that  either: 


2x  -|-  3  —  T  (2  —  x) 
2x  +  3  =  2  —  x 
3x  =  —  1 
x  =  —1/3 


or  2x  +  3  —  —(2—x) 
or  2x+  3  =  —  2  +x 
or  x  —  —5 

or  x  —  — 5 


Since  we  do  not  know  if  the  right  side  “a  —  2—x"  is  positive  or  negative,  we  must  perform  a  check  of  our 
answers  and  omit  any  that  are  incorrect. 

Ifx=  -1/3,  then  we  have  LS  =  |2(— 1/3)  +  3|  =  |-2/3  +  3|  =  |7/3|  =7/3  andtfS  =  2-  (-1/3)  = 
7/3.  In  this  case  LS  —  RS,  so  x  =  — 1/3  is  a  solution. 

Ifx=-5,  then  we  have  LS  =  |2(— 5)  +  3|  =  |  =  10  +  3|  =  |  -7|  =  7  md  RS  =  2-  (-5)  =  2  +  5  =  7. 
In  this  case  LS  —  RS,  so  x  =  —5  is  a  solution.  X 


We  next  look  at  absolute  values  and  inequalities. 
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Solution.  This  simply  means  —I<x  —  5<1.  Adding  5  to  each  gives  —2  <  x  <  12.  Therefore  the  solution 
is  the  interval  (—2, 12).  X 

In  some  questions  you  must  be  careful  when  multiplying  by  a  negative  number  as  in  the  next  problem. 


Solution.  This  simply  means  — 7  <  2  —  z  <  7.  Subtracting  2  gives:  —9<—z<  5.  Now  multiplying  by 
—  1  gives:  9  >  z  >  —5.  Remember  to  reverse  the  inequality  signs!  We  can  rearrange  this  as  — 5  <  z  <  9. 
Therefore  the  solution  is  the  interval  (—5,9).  X 


Solution.  Recall  that  for  a>  0,\x\>a  means  x  <  —a  or  x  >  a.  Thus,  either  2  —  z  <  — 7  or  2  —  z  >  7.  Either 
9  <  z  or  —  5  >  z.  Either  z>9  or  z<  —5.  In  interval  notation,  either  z  is  in  [9,°°)  or  z  is  in  (— °o,  —5].  All 
together,  we  get  our  solution  to  be:  (— ■ °°,  —5]  U  [9,°°).  X 

In  the  previous  two  examples  the  only  difference  is  that  one  had  <  in  the  question  and  the  other  had 
>.  Combining  the  two  solutions  gives  the  entire  real  number  line! 


Example  1.20:  Absolute  Value  Inequality 

Solve  0  < 

x  —  5 

<  7. 

Solution.  We  split  this  into  two  cases. 

(1)  For  0  <  |jc  —  5|  note  that  we  always  have  that  an  absolute  value  is  positive  or  zero  (i.e.,  0  <  \x  —  5\ 
is  always  true).  So,  for  this  part,  we  need  to  avoid  0  =  \x  —  5  |  from  occurring.  Thus,  x  cannot  be  5,  that  is, 
X  ^  5. 

(2)  For  |jc  —  5|  <  7,  we  have  —7  <  x  —  5  <7.  Adding  5  to  each  gives  —2  <  x  <  12.  Therefore  the 
solution  to  \x  —  5 1  <  7  is  the  interval  [—2, 12]. 

To  combine  (1)  and  (2)  we  need  combine  x  ^  5  with  x  6  [—2, 12] .  Omitting  5  from  the  interval  [—2, 12] 
gives  our  solution  to  be:  [—2, 5)  U  (5, 12] .  X 
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Exercises  for  1.1 


Exercise  1.1.1  Simplify  the  following  expressions  as  much  as  possible  assuming  x,y  >  0: 


(a) 

(b) 

(c) 


*V1/3 


l/y^x2 

3x~l/3y~2\/x4 


s/9xy  3 

1 6x2y 
x4 


2,A  1  ft  ^2 
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Exercise  1.1.2  Find  the  constants  a,b,c  if  the  expression 

4x-ly2yx 

2 Xy/y 

is  written  in  the  form  axbyc. 

Exercise  1.1.3  Find  the  roots  of  the  quadratic  equation 

x2-2x-24  =  0. 


Exercise  1.1.4  Solve  the  equation 

_ 2  =  _L 

4x— 16  x  —  3 


Exercise  1.1.5  Solve  the  following  inequalities.  Write  your  answer  as  a  union  of  intervals. 


(a)  3x+  1  >  6 

(b)  0  <  7x  —  1  <  1 


(c) 


x2(x—  1) 


(x  +  2)(x  +  3)3  _ 

(d)  x2  +  1  >  0 

(e)  x2  +  1  <  0 


<0 


(f)  x2  +  1  >  2x 

(g)  x3  >  Ax 


(h)  x3  >  Ax2 


(i)  ~  >2 

x 


(j) 


x  2 

- < - 

x  +  2  x—  1 


Exercise  1.1.6  Solve  the  equation  |6x  +  2|  =  1. 

Exercise  1.1.7  Find  solutions  to  the  following  absolute  value  inequalities.  Write  your  answer  as  a  union 
of  intervals. 
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(a)  x  >  2 

(d)  x  +  2  <  3x  — 6 

(b)  x  —  3  <  1 

(e)  2x  +  5  +4  >  1 

(c)  2x  +  5  >  4 

(f)  5  <  X+  1  <  8 

Exercise  1.1.8  Solve  the  equation  \/l  —  x  +  x  —  1. 


1.2  Analytic  Geometry 


In  what  follows,  we  use  the  notation  (xj,yi)  to  represent  a  point  in  the  (x,y)  coordinate  system,  also  called 
the  (x,y)-plane.  Previously,  we  used  (a,b)  to  represent  an  open  interval.  Notation  often  gets  reused  and 
abused  in  mathematics,  but  thankfully,  it  is  usually  clear  from  the  context  what  we  mean. 

In  the  ( x,y )  coordinate  system  we  normally  write  the  x-axis  horizontally,  with  positive  numbers  to  the 
right  of  the  origin,  and  the  y-axis  vertically,  with  positive  numbers  above  the  origin.  That  is,  unless  stated 
otherwise,  we  take  “rightward”  to  be  the  positive  x-direction  and  “upward”  to  be  the  positive  y-direction. 
In  a  purely  mathematical  situation,  we  normally  choose  the  same  scale  for  the  x-  and  y-axes.  For  example, 
the  line  joining  the  origin  to  the  point  (a,  a)  makes  an  angle  of  45°  with  the  x-axis  (and  also  with  the 
y-axis). 

In  applications,  often  letters  other  than  x  and  y  are  used,  and  often  different  scales  are  chosen  in  the 
horizontal  and  vertical  directions. 


Example  1.21:  Data  Plot 


Suppose  you  drop  a  coin  from  a  window,  and  you  want  to  study  how  its  height  above  the  ground 
changes  from  second  to  second.  It  is  natural  to  let  the  letter  t  denote  the  time  (the  number  of  seconds 
since  the  object  was  released)  and  to  let  the  letter  h  denote  the  height.  For  each  t  (say,  at  one-second 
intervals)  you  have  a  corresponding  height  h.  This  information  can  be  tabulated,  and  then  plotted 
on  the  ( t,h )  coordinate  plane,  as  shown  in  figure  1.1. 


We  use  the  word  “quadrant”  for  each  of  the  four  regions  into  which  the  plane  is  divided  by  the  axes: 
the  first  quadrant  is  where  points  have  both  coordinates  positive,  or  the  “northeast”  portion  of  the  plot,  and 
the  second,  third,  and  fourth  quadrants  are  counted  off  counterclockwise,  so  the  second  quadrant  is  the 
northwest,  the  third  is  the  southwest,  and  the  fourth  is  the  southeast. 

Suppose  we  have  two  points  A  and  B  in  the  (x,y) -plane.  We  often  want  to  know  the  change  in  x- 
coordinate  (also  called  the  “horizontal  distance”)  in  going  from  A  to  B.  This  is  often  written  Ax,  where 
the  meaning  of  A  (a  capital  delta  in  the  Greek  alphabet)  is  “change  in”.  Thus,  Ax  can  be  read  as  “change 
in  x”  although  it  usually  is  read  as  “delta  x”.  The  point  is  that  Ax'  denotes  a  single  number,  and  should  not 
be  interpreted  as  “delta  times  x”.  Similarly,  the  “change  in  y”  is  written  Ay  and  represents  the  difference 
between  the  y-coordinates  of  the  two  points.  It  is  the  vertical  distance  you  have  to  move  in  going  from  A 
to  B. 
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seconds 

0 

1 

2 

3 

4 
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80 

75.1 

60.4 

35.9 

1.6 

Figure  1.1:  A  data  plot,  height  versus  time. 


The  general  formulas  for  the  change  in  x  and  the  change  in  y  between  a  point  (xi,yi)  and  a  point 
(x2,y2)  are: 

Ax  =  x2-xi,  Ay  =  y2  —  y\- 

Note  that  either  or  both  of  these  might  be  negative. 

1.2.1.  Lines 


If  we  have  two  distinct  points  A(xi,yi)  and  i?(x2,y2),  then  we  can  draw  one  and  only  one  straight  line 
through  both  points.  By  the  slope  of  this  line  we  mean  the  ratio  of  Ay  to  Ax.  The  slope  is  often  denoted  by 
the  letter  m. 
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The  most  familiar  form  of  the  equation  of  a  straight  line  is: 

y  =  mx  +  b. 

Here  in  is  the  slope  of  the  line:  if  you  increase  x  by  1,  the  equation  tells  you  that  you  have  to  increase 
y  by  nr,  and  if  you  increase  x  by  Ax,  then  y  increases  by  Ay  =  in  Ax.  The  number  b  is  called  the  y- 
intercept,  because  it  is  where  the  line  crosses  the  y-axis  (when  x  =  0).  If  you  know  two  points  on  a  line, 
the  formula  m  =  (y2  —  yi)/(x2  —  xi)  gives  you  the  slope.  Once  you  know  a  point  and  the  slope,  then  the 
y-intercept  can  be  found  by  substituting  the  coordinates  of  either  point  in  the  equation:  y  i  =  mx\  +  b,  i.e., 
b  —  yi  —  mx | .  Alternatively,  one  can  use  the  “point-slope”  form  of  the  equation  of  a  straight  line:  start 
with  (y  —  y i )  /  (x  —  xi )  —  m  and  then  multiply  to  get 

(y-Jt)  =  m{x-x\), 

the  point-slope  form.  Of  course,  this  may  be  further  manipulated  to  get  y  =  mx  —  mx\  +  yj ,  which  is 
essentially  the  “y  =  mx  +  b”  form. 

It  is  possible  to  find  the  equation  of  a  line  between  two  points  directly  from  the  relation  m  —  (y  — 
yi)/(x  — xi)  =  (y2  —  yi)/(x2  —  xi),  which  says  “the  slope  measured  between  the  point  (xi,yi)  and  the 
point  (x2,y2)  is  the  same  as  the  slope  measured  between  the  point  (xi,yi)  and  any  other  point  (x,y)  on  the 
line.”  For  example,  if  we  want  to  find  the  equation  of  the  line  joining  our  earlier  points  A(2, 1)  and  B(3, 3), 
we  can  use  this  formula: 

y- 1  3-1 

in— - = - —  2,  so  that  y—  l  =  2(x  —  2),  i.e.,  y  =  2x  — 3. 

x  — 2  3  —  2 

Of  course,  this  is  really  just  the  point-slope  formula,  except  that  we  are  not  computing  in  in  a  separate  step. 
We  summarize  the  three  common  forms  of  writing  a  straight  line  below: 
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The  slope  m  of  a  line  in  the  form  y  —  mx  +  b  tells  us  the  direction  in  which  the  line  is  pointing.  If  m 
is  positive,  the  line  goes  into  the  1st  quadrant  as  you  go  from  left  to  right.  If  m  is  large  and  positive,  it 
has  a  steep  incline,  while  if  m  is  small  and  positive,  then  the  line  has  a  small  angle  of  inclination.  If  m  is 
negative,  the  line  goes  into  the  4th  quadrant  as  you  go  from  left  to  right.  If  m  is  a  large  negative  number 
(large  in  absolute  value),  then  the  line  points  steeply  downward.  If  m  is  negative  but  small  in  absolute 
value,  then  it  points  only  a  little  downward. 

If  m  =  0,  then  the  line  is  horizontal  and  its  equation  is  simply  y  —  b. 

All  of  these  possibilities  are  illustrated  below. 


There  is  one  type  of  line  that  cannot  be  written  in  the  formy  =  mx+b,  namely,  vertical  lines.  A  vertical 
line  has  an  equation  of  the  form  x  =  a.  Sometimes  one  says  that  a  vertical  line  has  an  “infinite”  slope. 

It  is  often  useful  to  find  the  x-intercept  of  a  line  y  —  mx  +  b.  This  is  the  x- value  when  y  =  0.  Setting 
mx  +  b  equal  to  0  and  solving  for  x  gives:  x  =  —b/m. 


It  is  often  necessary  to  know  if  two  lines  are  parallel  or  perpendicular.  Let  m\  and  mi  be  the  slopes  of 
the  nonvertical  lines  L\  and  L 2.  Then: 

•  L\  and  L2  are  parallel  if  and  only  if  77?  1  =  m2. 

•  L\  and  L2  are  perpendicular  if  and  only  if  mi  —  — .  (Equivalently,  m\  —  — ). 

7771  mi 
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In  the  case  of  perpendicular  lines,  we  say  their  slopes  are  negative  reciprocals.  Below  is  a  visual  represen¬ 
tation  of  a  pair  of  parallel  lines  and  a  pair  of  perpendicular  lines. 

Parallel  Lines  Perpendicular  Lines 


Example  1.25:  Equation  of  a  Line 


For  each  part  below,  find  an  equation  of  a  line  satisfying  the  requirements: 

(a)  Through  the  two  points  (0,3)  and  (—2,4). 

(b)  With  slope  7  and  through  point  (1,-2). 

(c)  With  slope  2  and  y-intercept  4. 

(d)  With  x-intercept  8  and  y-intercept  —3. 

(e)  Through  point  (5, 3)  and  parallel  to  the  line  2x  +  4y  +  2  =  0. 

2 

(f)  With  y-intercept  4  and  perpendicular  to  the  line  y  —  —  -x  +  3. 


Solution,  (a)  We  use  the  slope  formula  on  (xi,yi)  =  (0,3)  and  [x2,yi)  —  (—2,4)  to  find  m: 

(4)  -  (3)  11 

(-2)  -  (0)  -2  2' 

Now  using  the  point-slope  formula  we  get  an  equation  to  be: 

y-3  =  -^(*-o)  -»•  y  =  ~x+ 3. 

(b)  Using  the  point -slope  formula  with  m  =  7  and  (x\  ,yi )  =  (1,-2)  gives: 

y—  (—2)  =  l{x—  1)  ->•  y  —  lx  —  9. 

(c)  Using  the  slope-intercept  formula  with  m  =  2  and  b  —  4  we  get  y  —  2x  +  4. 

(d)  Note  that  the  intercepts  give  us  two  points:  (jq.yi)  =  (8,0)  and  {x2,yi)  =  (0,-3).  Now  follow  the 
steps  in  part  (a): 

— 3  —  0  _  3 
0-8  “  8' 


m  = 
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Using  the  point-slope  formula  we  get  an  equation  to  be: 

y-(-i)  =  ^(*-o)  y=\x~* * 3 * * * 7' 

(e)  The  line  2x  +  4  y  +  2  =  0  can  be  written  as: 

4y  —  —  2x  —  2  — >  y  =  —-x— -. 

2  2 

This  line  has  slope  —1/2.  Since  our  line  is  parallel  to  it,  we  have  m  —  —1/2.  Now  we  have  a  point 
(xi,yi)  =  (5,3)  and  slope  m  —  —1/2,  thus,  the  point-slope  formula  gives: 

y-3  =  -^(*-5). 


(f)  The  line  y  —  — -x  +  3  has  slope  m  —  —2/3.  Since  our  line  is  perpendicular  to  it,  the  slope  of  our  line 

is  the  negative  reciprocal,  hence,  m  —  3/2.  Now  we  have  b  —  4  and  m  —  3/2,  thus  by  the  slope-intercept 
formula,  an  equation  of  the  line  is 

3  „ 

y=-x  +  4. 


Example  1.26:  Parallel  and  Perpendicular  Lines 


Are  the  two  lines  lx  +  2y  +  3  =  0  and  6x  —  4y  +  2  —  0  perpendicular?  Are  they  parallel?  If  they  are 
not  parallel,  what  is  their  point  of  intersection? 


Solution.  The  first  line  is: 

7  3 

7x  +  2y  +  3  =  0  — >  2y  —  —lx—3  — *  y~~fX~2' 

It  has  slope  m\  —  —7/2.  The  second  line  is: 

3  1 

6x  —  4y  +  2  =  0  — >■  — 4y  =  — 6x  — 2  — »  y~2X^~2' 

It  has  slope  m2  —  3/2.  Since  m\  ■  mi  f  —  1  (they  are  not  negative  reciprocals),  the  lines  are  not  perpendic¬ 
ular.  Since  m\  f  m2  the  lines  are  not  parallel. 

We  find  points  of  intersection  by  setting  v-values  to  be  the  same  and  solving.  In  particular,  we  have 

7  3  3  1 

—  -x—  -  =  -x+  -. 


Solving  for  x  gives  x—  —2/5.  Then  substituting  this  into  either  equation  gives  y  =  —1/10.  Therefore,  the 
lines  intersect  at  the  point  (—2/5,  — 1/10).  X 
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1.2.2.  Distance  between  Two  Points  and  Midpoints 


Given  two  points  (jq, yi)  and  (x2,y2),  recall  that  their  horizontal  distance  from  one  another  is  Ax  =  X2  —  x\ 
and  their  vertical  distance  from  one  another  is  Ay  —  y2~yi-  Actually,  the  word  “distance”  normally 
denotes  “positive  distance”.  Ax  and  Ay  are  signed  distances,  but  this  is  clear  from  context.  The  (positive) 
distance  from  one  point  to  the  other  is  the  length  of  the  hypotenuse  of  a  right  triangle  with  legs  |Ax|  and 
|Ay|,  as  shown  in  figure  1.2.  The  Pythagorean  Theorem  states  that  the  distance  between  the  two  points  is 
the  square  root  of  the  sum  of  the  squares  of  the  horizontal  and  vertical  sides: 


Figure  1.2:  Distance  between  two  points  (here,  Ax  and  Ay  are  positive). 


As  a  special  case  of  the  distance  formula,  suppose  we  want  to  know  the  distance  of  a  point  (x,y)  to  the 
origin.  According  to  the  distance  formula,  this  is 


\J {x  -  0)2  +  (y  -  0)2  =  y/x2  +y2. 

A  point  (x,y)  is  at  a  distance  r  from  the  origin  if  and  only  if  y/x2  +  y2  =  r,  or,  if  we  square  both  sides: 
x2  4-y2  —  r2.  As  we  will  see,  this  is  the  equation  of  the  circle  of  radius,  r,  centered  at  the  origin. 

Furthermore,  given  two  points  we  can  determine  the  midpoint  of  the  line  segment  joining  the  two 
points. 
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r  1 

Midpoint  Formula 

The  midpoint  of  the  line  segment  joining  twc 
nates: 

midpoint  =  | 

i  points  (xi,yi)  and  {x2,yi)  is  the  point  with  coordi- 

(x\+x2  yi+y2\ 

l  2  ’  2  )■ 

Example  1.28:  Midpoint  of  a  Line  Segment 


Find  the  midpoint  of  the  line  segment  joining  the  given  points:  (1,0)  and  (5,  —2). 


Solution.  Using  the  midpoint  formula  on  (xi,yi)  =  (1,0)  and  (X2A2)  =  (5,  —2)  we  get: 


Thus,  the  midpoint  of  the  line  segment  occurs  at  (3,-1). 


1.2.3.  Conics 


X 


In  this  section  we  review  equations  of  parabolas,  circles,  ellipses  and  hyperbolas.  We  will  give  the  equa¬ 
tions  of  various  conics  in  standard  form  along  with  a  sketch.  A  useful  mnemonic  is  the  following. 


Mnemonic 


In  each  conic  formula  presented,  the  terms  ‘ x  —  h’  and  ‘y  —  k'  will  always  appear.  The  point  (h,k) 
will  alway  represent  either  the  centre  or  vertex  of  the  particular  conic. 


Note  that  h  or  k  (or  both)  may  equal  0. 

Vertical  Parabola:  The  equation  of  a  vertical  parabola  is: 

y  —  k  —  a(x  —  h )2 
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•  (h,k)  is  the  vertex  of  the  parabola.  •  If  a  >  0,  the  parabola  opens  upward, 

•a  is  the  vertical  stretch  factor.  •  If  a  <  0,  the  parabola  opens  downward. 


Horizontal  Parabola:  The  equation  of  a  horizontal  parabola  is: 

x  —  h  —  a{y  —  k )2 


•  (h,k)  is  the  vertex  of  the  parabola.  •  If  a  >  0,  the  parabola  opens  right. 

•  a  is  the  horizontal  stretch  factor.  •  If  a  <  0,  the  parabola  opens  left. 

Circle:  The  equation  of  a  circle  is: 


(; x-h)2  +  (y-k)2  =  r 2 


•  (h,k)  is  the  centre  of  the  circle.  •  r  is  the  radius  of  the  circle. 


(x-h)2  ,  (y-k)2 

O  '  7  0 


Ellipse:  The  equation  of  an  ellipse  is: 
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(h,k)  is  the  centre  of  the  ellipse. 

a  is  the  horizontal  distance  from  the  centre  to  the  edge  of  the  ellipse. 
b  is  the  vertical  distance  from  the  centre  to  the  edge  of  the  ellipse. 


Horizontal  Hyperbola:  The  equation  of  a  horizontal  hyperbola  is: 

(x-h)2  (y-k)2 


a 


b- 


( h,k )  is  the  centre  of  the  hyperbola. 


a,b  are  the  reference  box  values.  The  box  has 
a  centre  of  (h,k). 


=  1 

a  is  the  horizontal  distance  from  the  centre  to 
the  edge  of  the  box. 

b  is  the  vertical  distance  from  the  centre  to  the 
edge  of  the  box. 


Given  the  equation  of  a  horizontal  hyperbola,  one  may  sketch  it  by  first  placing  a  dot  at  the  point  (h,  k ) . 
Then  draw  a  box  around  (h,k)  with  horizontal  distance  a  and  vertical  distance  b  to  the  edge  of  the  box. 
Then  draw  dotted  lines  (called  the  asymptotes  of  the  hyperbola)  through  the  corners  of  the  box.  Finally, 
sketch  the  hyperbola  in  a  horizontal  direction  as  illustrated  below. 


y  + 


y  + 


(h,  k ) 


Vertical  Hyperbola:  The  equation  of  a  vertical  hyperbola  is: 

{x-h) 2  (y-k)2 


b 2 


-1 


{h,k)  is  the  centre  of  the  hyperbola. 


a  is  the  horizontal  distance  from  the  centre  to 
the  edge  of  the  box. 


•  a,  b  are  the  reference  box  values.  The  box  has  •  b  is  the  vertical  distance  from  the  centre  to  the 
a  centre  of  (h,k).  edge  of  the  box. 

Given  the  equation  of  a  vertical  hyperbola,  one  may  sketch  it  by  following  the  same  steps  as  with  a 
horizontal  hyperbola,  but  sketching  the  hyperbola  going  in  a  vertical  direction. 


y  + 


y  + 


(h,  k ) 
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Determining  the  Type  of  Conic 


An  equation  of  the  form 

Ax2  +  Bxy  +  Cy 2  +  Dx  +  Ey  +  F  =  0 

gives  rise  to  a  graph  that  can  be  generated  by  performing  a  conic  section  (parabolas,  circles,  ellipses, 
hyperbolas).  Note  that  the  Bxy  term  involves  conic  rotation.  The  Dx,  Ex,  and  F  terms  affect  the 
vertex  and  centre.  For  simplicity,  we  will  omit  the  Bxy  term.  To  determine  the  type  of  graph  we 
focus  our  analysis  on  the  values  of  A  and  C. 

•  If  A  =  C,  the  graph  is  a  circle. 

•  If  AC  >  0  (and  A  /  C),  the  graph  is  an  ellipse. 

•  If  AC  =  0,  the  graph  is  a  parabola. 

•  If  AC  <  0,  the  graph  is  a  hyperbola. 


Example  1.29:  Center  and  Radius  of  a  Circle 


Find  the  centre  and  radius  of  the  circle  y1  +  x2  —  1 2x  +  8y  +  43  =  0. 


Solution.  We  need  to  complete  the  square  twice,  once  for  the  x  terms  and  once  for  the  y  terms.  We’ll  do 
both  at  the  same  time.  First  let’s  collect  the  terms  with  x  together,  the  terms  with  y  together,  and  move  the 
number  to  the  other  side. 

(x2  —  12x)  +  (y2  +  8y)  =  —43 

We  add  36  to  both  sides  for  the  x  term  (—12  — »  —  —6  — »  (— 6)2  =  36),  and  16  to  both  sides  for  the  y 

term  (8  -►  §  =  4  -►  (4)2  =  16): 

(x2  —  12x  +  36)  +  (y2  +  8y  +  16)  =  -43  +  36  +  16 

Factoring  gives: 

(x-6)2  +  (y  +  4)2  =  32. 

Therefore,  the  centre  of  the  circle  is  (6,  —4)  and  the  radius  is  3.  X 


Solution.  Here  we  have  A  —  4  and  C  =  —  1.  Since  AC  <  0,  the  conic  is  a  hyperbola.  Let  us  complete  the 
square  for  the  x  and  y  terms.  First  let’s  collect  the  terms  with  x  together,  the  terms  with  y  together,  and 
move  the  number  to  the  other  side. 

(4x2  —  8x)  —  y2  =  —  8 

Now  we  factor  out  4  from  the  x  terms. 


4(x2  —  2x)  —  y2  =  — 8 
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Notice  that  we  don’t  need  to  complete  the  square  for  the  y  terms  (it  is  already  completed!).  To  complete 
the  square  for  the  x  terms  we  add  1  since  —2  — *  ^  =  —  1  — >■  (— 1)~  =  1,  taking  into  consideration  that  the 
a  value  is  4: 

4(x2-2x+l)-y2  =  -8  +  4-1 

Factoring  gives: 

4(x  —  l)2  —y2  —  —4 

A  hyperbola  in  standard  form  has  ±1  on  the  right  side  and  a  positive  x2  on  the  left  side,  thus,  we  must 
divide  by  4: 


Now  we  can  see  that  the  equation  represents  a  vertical  hyperbola  with  centre  (1,0)  (and  with  a  value 
\/T  =  1,  and  b  value  \f\  =  2).  X 


Example  1.31:  Equation  of  Parabola 


Find  an  equation  of  the  parabola  with  vertex  (1,-1)  that  passes  through  the  points  (—4,24)  and 
(7,35). 


Solution.  We  first  need  to  determine  if  it  is  a  vertical  parabola  or  horizontal  parabola.  See  figure  1.3  for 
a  sketch  of  the  three  points  (1,-1),  (—4,24)  and  (7,35)  in  the  xy-plane.  Note  that  the  vertex  is  (1,-1). 

y 


Figure  1.3:  Figure  for  Example  1.31 

Given  the  location  of  the  vertex,  the  parabola  cannot  open  downwards.  It  also  cannot  open  left  or  right 
(because  the  vertex  is  between  the  other  two  points  -  if  it  were  to  open  to  the  right,  every  other  point  would 
need  to  be  to  the  right  of  the  vertex;  if  it  were  to  open  to  the  left,  every  other  point  would  need  to  be  to 
the  left  of  the  vertex).  Therefore,  the  parabola  must  open  upwards  and  it  is  a  vertical  parabola.  It  has  an 
equation  of 

y  —  k  —  a(x  —  h)2. 

As  the  vertex  is  (h,k)  —  (1,  —  1)  we  have: 

y-(-!)  =a{x- l)2 

To  determine  a ,  we  substitute  one  of  the  points  into  the  equation  and  solve.  Let  us  substitute  the  point 
(x,y)  =  (—4,24)  into  the  equation: 

24  —  (  —  1)  =  a(—4  —  l)2  -A  25  =  25 a  ->■  a=  1. 
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Therefore,  the  equation  of  the  parabola  is: 


3;+l  =  (*-l)2. 

Note  that  if  we  substituted  (7, 35)  into  the  equation  instead,  we  would  also  get  a  —  1. 


X 


Exercises  for  1.2 


Exercise  1.2.1  Find  the  equation  of  the  line  in  the  form  y  =  mx  +  b: 

(a)  through  (1,1)  and  (-5,-3) 

(b)  through  (—1,2)  with  slope  —2 

(c)  through  (—1,1)  and  (5,—  3) 

(d)  through  (2, 5)  and  parallel  to  the  line  3x  +  9y  +  6  =  0 

( e)  with  x-intercept  5  and  perpendicular  to  the  line  y  —  2x  + 4 


Exercise  1.2.2  Change  the  following  equations  to  the  form  y  —  mx  +  b,  graph  the  line,  and  find  the  y- 
intercept  and  x-intercept. 

(a)  y  —  2x  —  2 

(b)  x  +  y  =  6 

(c)  x  —  2y—l 

(d)  3  —  2 y 

(e)  2x  +  3y  +  6  =  0 


Exercise  1.2.3  Determine  whether  the  lines  3x  +  6y  =  7  and  2x  +  4 y  =  5  are  parallel. 

Exercise  1.2.4  Suppose  a  triangle  in  the  ( x,y)-plane  has  vertices  (—1,0),  (1,0)  and  (0,2).  Find  the 
equations  of  the  three  lines  that  lie  along  the  sides  of  the  triangle  in  y  —  mx  +  b  form. 

Exercise  1.2.5  Let  x  stand  for  temperature  in  degrees  Celsius  (centigrade),  and  let  y  stand  for  temperature 
in  degrees  Fahrenheit.  A  temperature  ofO°C  corresponds  to  32° F,  and  a  temperature  of  100°  C  corresponds 
to  212 °F.  Find  the  equation  of  the  line  that  relates  temperature  Fahrenheit  y  to  temperature  Celsius  x  in 
the  form  y  =  mx  +  b.  Graph  the  line,  and  find  the  point  at  which  this  line  intersects  y  —  x.  What  is  the 
practiced  meaning  of  this  point? 
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Exercise  1.2.6  A  car  rental  firm  has  the  following  charges  for  a  certain  type  of  car:  $25  per  day  with  100 
free  miles  included,  $0.15  per  mile  for  more  than  100  miles.  Suppose  you  want  to  rent  a  car  for  one  day, 
and  you  know  you’ll  use  it  for  more  than  100  miles.  What  is  the  equation  relating  the  costy  to  the  number 
of  miles  x  that  you  drive  the  car? 

Exercise  1.2.7  A  photocopy  store  advertises  the  following  prices:  5c  per  copy  for  the  first  20  copies, 
4c  per  copy  for  the  21st  through  100th  copy,  and  3c  per  copy  after  the  100th  copy.  Let  x  be  the  number  of 
copies,  and  lety  be  the  total  cost  of  photocopying,  (a)  Graph  the  cost  as  x  goes  from  0  to  200  copies.  ( b ) 
Find  the  equation  in  the  form  y  =  mx  +  b  that  tells  you  the  cost  of  making  x  copies  when  x  is  more  than 
100. 

Exercise  1.2.8  Market  research  tells  you  that  if  you  set  the  price  of  an  item  at  $1.50,  you  will  be  able  to 
sell  5000  items;  and  for  every  10  cents  you  lower  the  price  below  $1.50  you  will  be  able  to  sell  another 
1000  items.  Let  x  be  the  number  of  items  you  can  sell,  and  let  P  be  the  price  of  an  item,  (a)  Express  P 
linearly  in  terms  of  x,  in  other  words,  express  P  in  the  form  P  =  mx  +  b.  (b)  Express  x  linearly  in  terms  of 

P. 

Exercise  1.2.9  An  instructor  gives  a  100-point  final  exam,  and  decides  that  a  score  90  or  above  will  be 
a  grade  of  4.0,  a  score  of  40  or  below  will  be  a  grade  of  0.0,  and  between  40  and  90  the  grading  will 
be  linear.  Let  x  be  the  exam  score,  and  let  y  be  the  corresponding  grade.  Find  a  formula  of  the  form 
y  =  mx  +  b  which  applies  to  scores  x  between  40  and  90. 

Exercise  1.2.10  Find  the  distance  between  the  pairs  of  points: 

( a)  (—1, 1)  and  (1, 1). 

(b)  (5,3)  and  (— 7,—  2). 

(c)  (1,1)  and  the  origin. 

Exercise  1.2.11  Find  the  midpoint  of  the  line  segment  joining  the  point  (20,-10)  to  the  origin. 


Exercise  1.2.12  Find  the 

equation  of  the  circle  of  radius  3  centered  at: 

(a)  (0,0) 

(d)  (0,3) 

(b)  (5,6) 

(e)  (0,-3) 

(c)  (-5,-6) 

if)  (3,0) 

Exercise  1.2.13  For  each  pair  of  points  A(xi,yi)  and  B{x2,yi)  find  an  equation  of  the  circle  with  center 
at  A  that  goes  through  B. 

(a)  A(2,0),  5(4,3)  (b)  A(- 2,3),  5(4,3) 


Exercise  1.2.14  Determine  the  type  of  conic  and  sketch  it. 
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(a)  x 2  +  y2  +  lOy  =  0 

(b)  9x2-90x  +  y2  +  81  =0 

(c)  6x  +  y2  —  8>’  =  0 


Exercise  1.2.15  Find  the  standard  equation  of  the  circle  passing  through  (—2, 1)  and  tangent  to  the  line 
3x  —  2y  —  6  at  the  point  (4,3)-  Sketch.  (Hint:  The  line  through  the  center  of  the  circle  and  the  point  of 
tangency  is  perpendicular  to  the  tangent  line.) 


1.3  Trigonometry 

In  this  section  we  review  the  definitions  of  trigonometric  functions. 

1.3.1.  Angles  and  Sectors  of  Circles 

Mathematicians  tend  to  deal  mostly  with  radians  and  we  will  see  later  that  some  formulas  are  more  elegant 
when  using  radians  (rather  than  degrees).  The  relationship  between  degrees  and  radians  is: 

Tl  rad  =  180°. 

Using  this  formula,  some  common  angles  can  be  derived: 


Degrees 

0° 

O 

o 

CO 

45° 

60° 

o 

o 

120° 

135° 

150° 

180° 

270° 

360° 

Radians 

0 

K 

71 

71 

71 

27 r 

371 

571 

71 

3  71 

2  71 

6 

4 

3 

2 

3 

4 

6 

2 

From  now  on,  unless  otherwise  indicated,  we  will  always  use  radian  measure. 
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In  the  diagram  below  is  a  sector  of  a  circle  with  central  angle  9  and  radius  r  subtending  an  arc  with 
length  s. 


When  9  is  measure  in  radians,  we  have  the  following  formula  relating  9,  s  and  r. 

$ 

6  —  -  or  s  —  rO. 
r 


Sector  Area 

The  area  of  the  sector  is  equal  to: 

1  ? 

Sector  Area  =  -r~9. 

2 

Example  1.34:  Angle  Subtended  by  Arc 


If  a  circle  has  radius  3  cm,  then  an  angle  of  2  rad  is  subtended  by  an  arc  of  6  cm  (s  —  rO  =  3-2  —  6). 


Example  1.35:  Area  of  Circle 


If  we  substitute  9 
circle: 


2 n  (a  complete  revolution)  into  the  sector  area  formula  we  get  the  area  of  a 


A  =  -r2(  2k)  =  nr 2 


1.3.2.  Trigonometric  Functions 


•  Tangent  (abbreviated  by  tan) 

•  Cosecant  (abbreviated  by  esc) 


There  are  six  basic  trigonometric  functions: 

•  Sine  (abbreviated  by  sin) 

•  Cosine  (abbreviated  by  cos) 
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•  Secant  (abbreviated  by  sec)  •  Cotangent  (abbreviated  by  cot) 

We  first  describe  trigonometric  functions  in  terms  of  ratios  of  two  sides  of  a  right  angle  triangle 
containing  the  angle  0 . 


Opposite 

(opp) 


With  reference  to  the  above  triangle,  for  an  acute  angle  6  (that  is,  0  <  0  <  tt/2),  the  six  trigonometric 
functions  can  be  described  as  follows: 


sin0 


OPP 

hyp 


cos  6 


tan0 


adj 

hyp 

OPP 

adj 


esc  0 

sec  6 

cot  6 


hyp 

OPP 

hyp 

adj 

adj 

opp 


Notice  that  sin  is  the  ratio  of  the  opposite  and  hypotenuse.  We  use  the  mnemonic  SOH  to  remember 
this  ratio.  Similarly,  CAH  and  TOA  remind  us  of  the  cos  and  tan  ratios. 


Mnemonic 


The  mnemonic  SOH  CAH  TOA  is  useful  in  remembering  how  trigonometric  functions  of  acute 
angles  relate  to  the  sides  of  a  right  triangle. 


This  description  does  not  apply  to  obtuse  or  negative  angles.  To  define  the  six  basic  trigonometric 
functions  we  first  define  sine  and  cosine  as  the  lengths  of  various  line  segments  from  a  unit  circle,  and 
then  we  define  the  remaining  four  basic  trigonometric  functions  in  terms  of  sine  and  cosine. 

Take  a  line  originating  at  the  origin  (making  an  angle  of  0  with  the  positive  half  of  the  x-axis)  and 
suppose  this  line  intersects  the  unit  circle  at  the  point  (x,y).  The  x-  and  y-coordinates  of  this  point  of 
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intersection  are  equal  to  cos  0  and  sin  0,  respectively. 


For  angles  greater  than  2k  or  less  than  —2k,  simply  continue  to  rotate  around  the  circle.  In  this  way,  sine 
and  cosine  become  periodic  functions  with  period  2k: 

sin0  =  sin(0  +2Kk)  cos  0  =  cos  (0  +  2kJc) 


for  any  angle  0  and  any  integer  k. 

Above,  only  sine  and  cosine  were  defined  directly  by  the  circle.  We  now  define  the  remaining  four 
basic  trigonometric  functions  in  terms  of  the  functions  sin  0  and  cos  0: 


tan0 


sin0 

COS0 


sec0 


1 

cos  0 


CSC  0 


1 

sin0 


cot0 


COS0 

sin0 


1.3.3.  Computing  Exact  Trigonometric  Ratios 


The  unit  circle  is  often  used  to  determine  the  exact  value  of  a  particular  trigonometric  function. 
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Reading  from  the  unit  circle  one  can  see  that  cos5/r/6  =  —  \/3/2  and  sin57r/6  =1/2  (remember  the 
that  the  ^-coordinate  is  cos  0  and  the  y-coordinate  is  sin0).  However,  we  don’t  always  have  access  to  the 
unit  circle.  In  this  case,  we  can  compute  the  exact  trigonometric  ratios  for  0  —  5k/ 6  by  using  special 
triangles  and  the  CAST  rule  described  below. 

The  first  special  triangle  has  angles  of  45°, 45°, 90°  (i.e.,  k/4,k/4,k/2)  with  side  lengths  1,1, y/2, 
while  the  second  special  triangle  has  angles  of  30°,  60°,  90°  (i.e.,  k/6,k/5,k/2)  with  side  lengths  1,2,  \/3- 
They  are  classically  referred  to  as  the  l-l-y/2  triangle,  and  the  1 -2- \/3  triangle,  respectively,  shown  below. 


1 
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Mnemonic 


The  first  triangle  should  be  easy  to  remember.  To  remember  the  second  triangle,  place  the  largest 
number  (2)  across  from  the  largest  angle  (90°  =  it/ 2).  Place  the  smallest  number  (1)  across  from 
the  smallest  angle  (30°  —  k/ 6).  Place  the  middle  number  (\/3  ~  1.73)  across  from  the  middle  angle 
(60°  —  it/ 3).  Double  check  using  the  Pythagorean  Theorem  that  the  sides  satisfy  a2  +  b2  —  c2. 


The  special  triangles  allow  us  to  compute  the  exact  value  (excluding  the  sign)  of  trigonometric  ratios, 
but  to  determine  the  sign,  we  can  use  the  CAST  rule. 


The  CAST  Rule 


The  CAST  rule  says  that  in  quadrant  I  all  three  of  sin  0,  cos  0,  tan0  are  positive.  In  quadrant  II, 
only  sin0  is  positive,  while  cos0,  tan0  are  negative.  In  quadrant  III,  only  tan0  is  positive,  while 
sin0,  cos0  are  negative.  In  quadrant  IV,  only  cos0  is  positive,  while  sin0,  tan0  are  negative.  To 
remember  this,  simply  label  the  quadrants  by  the  letters  C-A-S-T  starting  in  the  bottom  right  and 
labelling  counter-clockwise. 


sin#  >  0 

All  >  0 

S 

A 

II 

I 

III 

IV 

T 

c 

tan  9  >  0 

cos  9  >  0 

Example  1.36:  Determining  Trigonometric  Ratios  Without  Unit  Circle 


Determine  sin57r/6,  cos57r/6,  tan57r/6,  sec5;r/6,  csc5tt/6  and  cot57r/6  exactly  by  using  the  spe¬ 
cial  triangles  and  CAST  rule. 


Solution.  We  start  by  drawing  the  xy-plane  and  indicating  our  angle  of  5^/6  in  standard  position  (positive 
angles  rotate  counterclockwise  while  negative  angles  rotate  clockwise ).  Next,  we  drop  a  perpendicular  to 
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the  x-axis  (never  drop  it  to  the  y-axis!). 


Notice  that  we  can  now  figure  out  the  angles  in  the  triangle.  Since  180°  =  n,  we  have  an  interior  angle  of 
K  —  5k/6  =  k/6  inside  the  triangle.  As  the  angles  of  a  triangle  add  up  to  180°  =  K,  the  other  angle  must 
be  k/3.  This  gives  one  of  our  special  triangles.  We  label  it  accordingly  and  add  the  CAST  rule  to  our 
diagram. 


From  the  above  figure  we  see  that  5tt/6  lies  in  quadrant  II  where  sin  0  is  positive  and  cos  0  and  tan  0  are 
negative.  This  gives  us  the  sign  of  sin  0,  cos  0  and  tan  0.  To  determine  the  value  we  use  the  special  triangle 
and  SOH  CAH  TOA. 

Using  sin  0  —  opp/hyp  we  find  a  value  of  1/2.  Since  sin  0  is  positive  in  quadrant  II,  we  have 

.  5k  1 

Sm-  =  +-. 

Using  cos  9  —  ad  j/hyp  we  find  a  value  of  y/3/2.  But  cos  0  is  negative  in  quadrant  II,  therefore, 

5k  a/3 

cos  —  =  — — . 


Using  tan  0  =  opp/ad j  we  find  a  value  of  1/ y/3.  But  tan  0  is  negative  in  quadrant  II,  therefore, 


1 

7!' 
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To  determine  sec  0,  esc  0  and  cot  0  we  use  the  definitions: 


1 


+2, 


5  n  1  2 

sec  = - t = , 

6  cos  ^  y/3 


1 

tan^ 


-y/3. 


X 


Solution.  We  first  draw  a  right  angle  triangle.  Since  cos  0  —  ad  j /hyp  =  3/7,  we  let  the  adjacent  side  have 
length  3  and  the  hypotenuse  have  length  7. 


Using  the  Pythagorean  Theorem,  we  have  32  +  (opp)2 


72.  Thus,  the  opposite  side  has  length  \^40. 


cot0 


1 

tan0 


°PP  =  V40 


Since  we  are  given  37t/2  <  0  <  2 7T,  we  are  in  the  fourth  quadrant.  By  the  CAST  rule,  tan0  is  negative  in 
this  quadrant.  As  tan  9  —  opp /ad j ,  it  has  a  value  of  n/40/3,  but  by  the  CAST  rule  it  is  negative,  that  is, 


Therefore, 


tan0  = 


V40 

~ir 


COt0  = 


3 

\/40' 


X 
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1.3.4.  Graphs  of  Trigonometric  Functions 


The  graph  of  the  functions  sinx  and  cosx  can  be  visually  represented  as: 


Both  sinx  and  cosx  have  domain  (—  oo5oo)  and  range  [—1,1].  That  is, 

—  1  <  sinx  <1  —  1  <  cosx  <  1. 


The  zeros  of  sinx  occur  at  the  integer  multiples  of  Jt,  that  is,  sinx  =  0  whenever  x  =  mi,  where  n  is  an 
integer.  Similarly,  cosx  =  0  whenever  x  =  n/2  +  nn,  where  n  is  an  integer. 

The  six  basic  trigonometric  functions  can  be  visually  represented  as: 


sine  - 

cosine  - 

tangent  - 

cosecant - 

secant - 

cotangent - 


Both  tangent  and  cotangent  have  range  (—00,00),  whereas  cosecant  and  secant  have  range  (—  °°,  —  1]  U 
[1,0°).  Each  of  these  functions  is  periodic.  Tangent  and  cotangent  have  period  n,  whereas  sine,  cosine, 
cosecant  and  secant  have  period  2n. 


1.3.5.  Trigonometric  Identities 


There  are  numerous  trigonometric  identities,  including  those  relating  to  shift/periodicity,  Pythagoras  type 
identities,  double-angle  formulas,  half-angle  formulas  and  addition  formulas.  We  list  these  below. 
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1 .  Shifts  and  periodicity 


sin(0  +  27r)  =  sin0 

cos(0  +  27r)  =  COS0 

tan(0+27r)  =  tan0 

sin(0  +  7r)  =  —  sin0 

cos(0  +  7r)  =  —cos  0 

tan(0  +  7r)  =  tan0 

sin(— 0)  =  —  sin0 

cos( — 0)  =  COS0 

tan(— 0)  =  —  tan0 

sin(^  — 0)  =  cos0 

cos(^  —  0)  = 

tan  ^  —  0^  =  cot0 

2.  Pythagoras  type  formulas 

•  sin2  0  +  cos2  0  =  1 

•  tan2  0  +  1  =  sec2  0 

•  1  +  cot2  0  =  esc2  0 


4.  Half-angle  formulas 

o  l+cos(20) 

•  cos2  0  =  — -e — - 
2 

.  9  1  —  cos(20) 

.  sin-  0  = - -2 — 


5.  Addition  formulas 


3.  Double-angle  formulas 

•  sin(20)  =  2  sin  0  cos  0 

cos(2 0)  =  cos2  0  — sin2  0 

•  =  2cos20  — 1 
=  1  —  2  sin2  0 . 


•  sin(0  +  0) 

•  cos  (0  +  0) 

•  tan(0  +  0) 

•  sin(0  —  (j>) 

•  COS  (0  —  0) 


sin  0  cos  (/)  +  cos  0  sin  ( p 
cos  0  cos  0  —  sin  0  sin  0 
tan  0  +  tan  0 
1  —  tan  0  tan  < p 
sin  0  cos  0  —  cos  0  sin  0 
cos  0  cos  (f>  +  sin  0  sin  0 


Solution.  Using  the  double-angle  formula  sin2x  =  2sinxcosx  we  have: 

2sinxcosx  =  sinx 

2  sinxcosx  —  sinx  =  0 
sinx(2cosx—  1)  =  0 

Thus,  either  sinx  =  0  or  cosx  =1/2.  For  the  first  case  when  sinx  =  0,  we  get  x  =  0  or  x  —  K.  For  the 
second  case  when  cosx  =  1/2,  we  get  x  =  n/3  (use  the  special  triangles  and  CAST  rule  to  get  this).  Thus, 
we  have  three  solutions:  x  =  0,  x  =  tt/3,  x  —  K.  X 
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Exercises  for  1.3 


Exercise  1.3.1  Find  all  values  ofO  such  that  sin(0)  =  —1;  give  your  answer  in  radians. 
Exercise  1.3.2  Find  all  values  ofO  such  that  cos(2  0)  =  1/2;  give  your  answer  in  radians. 
Exercise  1.3.3  Compute  the  following: 

(a)  sin(37r)  (d)  csc(47t/3) 

( b)  sec(57r/6)  (e)  tan(77r/4) 

(c)  cos(— n/3)  (f)  cot(137r/4) 

Exercise  1.3.4  If  sin  0  —  |  and  §  <  0  <  k,  then  find  sec  0. 

Exercise  1.3.5  Suppose  that  tan  0  —  x  and  n  <  0  <  find  sin  0  and  cos  6  in  terms  ofx. 
Exercise  1.3.6  Find  an  angle  0  such  that  and  sin  0  =  sin 

Exercise  1.3.7  Use  an  angle  sum  identity  to  compute  cos(7r/ 1 2). 

Exercise  1.3.8  Use  an  angle  sum  identity  to  compute  tan(57r/12). 

Exercise  1.3.9  Verify  the  following  identities 

(a)  cos2(t)/(l  —  sin(f))  =  l  +  sin(t) 

(b)  2csc(20)  —  sec(6)  csc(0) 

(c)  sin(30)  —  sin(0)  =  2cos(20) sin(0) 

Exercise  1.3.10  Sketch  the  following  functions: 

(a)  y  —  2sin(x) 

(b)  y  —  sin(3x) 

(c)  y  —  sin(—  x) 

Exercise  1.3.11  Find  all  of  the  solutions  o/’ 2  sin  (4)  —  1  —  sin2  (/ )  =  0  in  the  interval  [0, 2n\. 
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1.4  Additional  Exercises 


These  problems  require  a  comprehensive  knowledge  of  the  skills  reviewed  in  this  chapter.  They  are  not 
in  any  particular  order.  A  proficiency  in  these  skills  will  help  you  a  long  way  as  your  learn  the  calculus 
material  in  the  following  chapters. 

Exercise  1.4.1  Rationalize  the  denominator  for  each  of  the  following  expressions.  That  is,  re-write  the 
expression  in  such  a  way  that  no  square  roots  appear  in  the  denominator.  Also,  simplify  your  answers  if 
possible. 


(a) 


1 

72 


(b) 


3  h 

\/.\  h  1  —  \/x  A  1 


Exercise  1.4.2  Solve  the  following  equations. 

(a)  2  —  5(x  —  3)  =  4—  lOx 

(b)  2x2  —  5x  —  3 

(c)  x2  —x  — 3  —  0 

(d)  x2  +  x  +  3  =  0 

(e)  Vx2  +  9  =  2x 


Exercise  1.4.3  By  means  of  counter-examples,  show  why  it  is  wrong  to  say  that  the  following  equations 
hold  for  all  real  numbers  for  which  the  expressions  are  defined. 


(a)  (x  —  2)2=x2  —  22 


(b) 


1 

x  +  h 


1 

- h 

X 


1 

h 


(c)  y/x2  +  y2  =x  +  y 


Exercise  1.4.4  Find  an  equation  of  the  line  passing  through  the  point  (—2,5)  and  parallel  to  the  line 
x  +  3y  —  2  —  0. 

x2  —  1 

Exercise  1.4.5  Solve  - - <  1. 

3x  —  1  “ 


Exercise  1.4.6  Explain  why  the  following  expression  never  represents  a  real  number  ( for  any  real  number 
x):  Vx  —  2  +  \/T  —  x. 
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Exercise  1.4.7  Simplify  the  expression 


[3  (x  +  /?)  +  4]  —  [3x2  +  4] 


as  much  as  possible. 


x+h 


Exercise  1.4.8  Simplify  the  expression  ~^x+ll'>  1 - — -  as  much  as  possible. 

h 

Exercise  1.4.9  Simplify  the  expression  —  sinx(cosx  +  3  sinx)  —  cos x(—  sinx  +  3  cosx). 
Exercise  1.4.10  Solve  the  equation  cosx  =  ^  on  the  interval  0  <  x  <  2k. 

Exercise  1.4.11  Find  an  angle  6  such  that  0  <  0  <  K  and  cos  9  =  cos  ^y-. 

|x|  + 14  — x| 


Exercise  1.4.12  What  can  you  say  about 


x  — 2 


when  x  is  a  large  (positive)  number? 


Exercise  1.4.13  Find  an  equation  of  the  circle  with  centre  in  (—2,3)  and  passing  through  the  point 

(l.-l). 

Exercise  1.4.14  Find  the  centre  and  radius  of  the  circle  described  by  x2  +y2  +  6x  —  4_y  +12  =  3. 
Exercise  1.4.15  Ify  —  9x2  +  6x  +  7,  find  all  possible  values  ofy. 


Exercise  1.4.16  Simplify 


3x2y3z  1 x 
1 8x  1  vz3 


3x  +  2 

Exercise  1.4.17  Ify  — - — ,  then  what  is  x  in  terms  ofy? 

1  -  4x 

Exercise  1.4.18  Divide  x2  +  3x  —  5  by  x  +  2  to  obtain  the  quotient  and  the  remainder.  Equivalently,  find 
polynomial  Q(x)  and  constant  R  such  that 

x2  +  3x  —  5  R 

=  Q{x)  + 


x  4“  2 


x  4~  2 


2.  Functions 


2.1  What  is  a  Function? 


A  function  y  —  f(x)  is  a  rule  for  determining  y  when  we’re  given  a  value  of  x.  For  example,  the  rule 
y  —  f{x)  —  2x  +  1  is  a  function.  Any  line  y  =  mx  +  b  is  called  a  linear  function.  The  graph  of  a  function 
looks  like  a  curve  above  (or  below)  the  x-axis,  where  for  any  value  of  x  the  rule  y  —  f{x)  tells  us  how  far 
to  go  above  (or  below)  the  x-axis  to  reach  the  curve. 

Functions  can  be  defined  in  various  ways:  by  an  algebraic  formula  or  several  algebraic  formulas,  by 
a  graph,  or  by  an  experimentally  determined  table  of  values.  In  the  latter  case,  the  table  gives  a  bunch  of 
points  in  the  plane,  which  we  might  then  interpolate  with  a  smooth  curve,  if  that  makes  sense. 

Given  a  value  of  x,  a  function  must  give  at  most  one  value  of  y.  Thus,  vertical  lines  are  not  functions. 
For  example,  the  line  x  =  1  has  infinitely  many  values  of  y  if  x  =  1 .  It  is  also  true  that  if  x  is  any  number 
(not  1)  there  is  no  y  which  corresponds  to  x,  but  that  is  not  a  problem — only  multiple  y  values  is  a  problem. 

One  test  to  identify  whether  or  not  a  curve  in  the  (x,y)  coordinate  system  is  a  function  is  the  following. 


Theorem  2.1:  The  Vertical  Line  Test 


A  curve  in  the  (x,y)  coordinate  system  represents  a  function  if  and  only  if  no  vertical  line  intersects 
the  curve  more  than  once. 


In  addition  to  lines,  another  familiar  example  of  a  function  is  the  parabola  y  =  /(x)  =  x2.  We  can  draw 
the  graph  of  this  function  by  taking  various  values  of  x  (say,  at  regular  intervals)  and  plotting  the  points 
(x,/(x))  =  (x,x2).  Then  connect  the  points  with  a  smooth  curve.  (See  figure  2.1.) 

The  two  examples  y  =  /(x)  =  2x  +  1  and  y  —  f(x)  —  x2  are  both  functions  which  can  be  evaluated  at 
any  value  of  x  from  negative  infinity  to  positive  infinity.  For  many  functions,  however,  it  only  makes  sense 
to  take  x  in  some  interval  or  outside  of  some  “forbidden”  region.  The  interval  of  x- values  at  which  we’re 
allowed  to  evaluate  the  function  is  called  the  domain  of  the  function. 
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y  =  fix)  =  x2  y  =  f{x )  =  yfx 

Figure  2.1:  Graphs  of  Functions 


Example  2.2:  Domain  of  the  Square-Root  Function 


The  square-root  function  y  —  f(x)  —  yfx  is  the  rule  which  says,  given  an  x-value,  take  the  nonneg¬ 
ative  number  whose  square  is  x.  This  rule  only  makes  sense  if  x  >  0.  We  say  that  the  domain  of 
this  function  is  x  >  0,  or  more  formally  {x  €  M  :  x  >  0}.  Alternately,  we  can  use  interval  notation, 
and  write  that  the  domain  is  [0,°°).  The  fact  that  the  domain  ofy—  yfx  is  [0,°°)  means  that  in  the 
graph  of  this  function  (see  figure  2.1)  we  have  points  (x,  y)  only  above  x-values  on  the  right  side  of 
the  x-axis. 


Another  example  of  a  function  whose  domain  is  not  the  entire  x-axis  is:  y  —  f(x)  —  1/x,  the  reciprocal 
function.  We  cannot  substitute  x  =  0  in  this  formula.  The  function  makes  sense,  however,  for  any  nonzero 
x,  so  we  take  the  domain  to  be:  {xel  :  xf  0}.  The  graph  of  this  function  does  not  have  any  point  (x,_y) 
with  x  =  0.  As  x  gets  close  to  0  from  either  side,  the  graph  goes  off  toward  infinity.  We  call  the  vertical 
line  x  =  0  an  asymptote. 

To  summarize,  two  reasons  why  certain  x-values  are  excluded  from  the  domain  of  a  function  are  the 
following. 


Restrictions  for  the  Domain  of  a  Function 


1 .  We  cannot  divide  by  zero,  and 

2.  We  cannot  take  the  square  root  of  a  negative  number. 


When  the  domain  of  a  function  is  restricted,  we  say  that  the  function  is  undefined  at  that  point.  We 
will  encounter  some  other  ways  in  which  functions  might  be  undefined  later. 

Another  reason  why  the  domain  of  a  function  might  be  restricted  is  that  in  a  given  situation  the  x-values 
outside  of  some  range  might  have  no  practical  meaning.  For  example,  if  y  is  the  area  of  a  square  of  side  x, 
then  we  can  write  y  —  f(x)  —x2.  In  a  purely  mathematical  context  the  domain  of  the  function  y  =  x2  is  all 
of  R.  However,  in  the  story-problem  context  of  finding  areas  of  squares,  we  restrict  the  domain  to  positive 
values  of  x,  because  a  square  with  negative  or  zero  side  makes  no  sense. 

In  a  problem  in  pure  mathematics,  we  usually  take  the  domain  to  be  all  values  of  x  at  which  the 
formulas  can  be  evaluated.  However,  in  a  story  problem  there  might  be  further  restrictions  on  the  domain 
because  only  certain  values  of  x  are  of  interest  or  make  practical  sense. 


2. 1 .  What  is  a  Function?  47 


In  a  story  problem,  we  often  use  letters  other  than  x  and  y.  For  example,  the  volume  V  of  a  sphere 
is  a  function  of  the  radius  r,  given  by  the  formula  V  =  f(r)  —  | nr 3.  Also,  letters  different  from  /  may 
be  used.  For  example,  if  y  is  the  velocity  of  something  at  time  t,  we  may  write  y  —  v(t  )  with  the  letter  v 
(instead  of  /)  standing  for  the  velocity  function  (and  t  playing  the  role  of  x). 

The  letter  playing  the  role  of  x  is  called  the  independent  variable,  and  the  letter  playing  the  role  of  y 
is  called  the  dependent  variable  (because  its  value  “depends  on”  the  value  of  the  independent  variable). 
In  story  problems,  when  one  has  to  translate  from  English  into  mathematics,  a  crucial  step  is  to  determine 
what  letters  stand  for  variables.  If  only  words  and  no  letters  are  given,  then  we  have  to  decide  which  letters 
to  use.  Some  letters  are  traditional.  For  example,  almost  always,  t  stands  for  time. 


Example  2.3:  Open  Box 


An  open-top  box  is  made  from  an  ax  b  rectangular  piece  of  cardboard  by  cutting  out  a  square  of 
side  x  from  each  of  the  four  corners,  and  then  folding  the  sides  up  and  sealing  them  with  duct  tape. 
Find  a  formula  for  the  volume  V  of  the  box  as  a  function  ofx,  and  find  the  domain  of  this  function. 


Solution.  The  box  we  get  will  have  height  x  and  rectangular  base  of  dimensions  a  —  2x  by  b  —  2x.  Thus, 

V  =  f(x)  —  x(a  —  2  x)(b  —  2x) . 

Here  a  and  b  are  constants,  and  V  is  the  variable  that  depends  on  x,  i.e.,  V  is  playing  the  role  of  y. 

This  formula  makes  mathematical  sense  for  any  x,  but  in  the  story  problem  the  domain  is  much  less. 
In  the  first  place,  x  must  be  positive.  In  the  second  place,  it  must  be  less  than  half  the  length  of  either  of 
the  sides  of  the  cardboard.  Thus,  the  domain  is 

|x  E  M  :  0  <  x  <  -(minimum  of  a  and  b) 

In  interval  notation  we  write:  the  domain  is  the  interval  (0,min(a,Z?)/2).  You  might  think  about  whether 
we  could  allow  0  or  (the  minimum  of  a  and  b )  to  be  in  the  domain.  They  make  a  certain  physical  sense, 
though  we  normally  would  not  call  the  result  a  box.  If  we  were  to  allow  these  values,  what  would  the 
corresponding  volumes  be?  Does  that  volume  make  sense?  X 


Example  2.4:  Circle  of  Radius  r  Centered  at  the  Origin 


Is  the  circle  of  radius  r  centered  at  the  origin  the  graph  of  a  function? 


Solution.  The  equation  for  this  circle  is  usually  given  in  the  form  x2  +y2  =  r2.  To  write  the  equation  in 
the  form  y  —  f(x)  we  solve  for  y,  obtaining  y  —  ±  V r2  —  x2 .  But  this  is  not  a  function,  because  when  we 
substitute  a  value  in  the  interval  (— r,  r )  for  x  there  are  two  corresponding  values  of  y.  To  get  a  function,  we 
must  choose  one  of  the  two  signs  in  front  of  the  square  root.  If  we  choose  the  positive  sign,  for  example, 
we  get  the  upper  semicircle  y  —  /(x)  —  y/ r2  —  x2  (see  figure  2.2).  The  domain  of  this  function  is  the 
interval  [— r,r],  i.e.,  x  must  be  between  —  r  and  r  (including  the  endpoints).  If  x  is  outside  of  that  interval, 
then  r 2  —  x2  is  negative,  and  we  cannot  take  the  square  root.  In  terms  of  the  graph,  this  just  means  that 
there  are  no  points  on  the  curve  whose  x-coordinate  is  greater  than  r  or  less  than  —r.  X 
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Figure  2.2:  Upper  semicircle  y  —  \fr2  —  x2. 


Solution.  To  answer  this  question,  we  must  rule  out  the  x-values  that  make  4x  —  x 2  negative  (because  we 
cannot  take  the  square  root  of  a  negative  number)  and  also  the  x-values  that  make  4x  —  x2  zero  (because 
if  4x  —  x2  —  0,  then  when  we  take  the  square  root  we  get  0,  and  we  cannot  divide  by  0).  In  other  words, 
the  domain  consists  of  all  x  for  which  4x  —  x2  is  strictly  positive.  The  inequality  4x  —  x2  >  0  was  solved  in 
Example  1.12.  In  interval  notation,  the  domain  is  the  interval  (0,4).  X 

A  function  does  not  always  have  to  be  given  by  a  single  formula  as  the  next  example  demonstrates. 


Example  2.6:  Piecewise  Velocity 


Suppose  thaty  —  v(t)  is  the  velocity  function  for  a  car  which  starts  out  from  rest  (zero  velocity)  at 
time  t  =  0;  then  increases  its  speed  steadily  to  20  m/sec,  taking  10  seconds  to  do  this;  then  travels 
at  constant  speed  20  m/sec  for  15  seconds;  and  finally  applies  the  brakes  to  decrease  speed  steadily 
to  0,  taking  5  seconds  to  do  this.  The  formula  for  y  —  v(t)  is  different  in  each  of  the  three  time 
intervals:  first  y  —  2x,  then  y  —  20,  then  y  —  —4x-\- 120.  The  graph  of  this  function  is  shown  in 
figure  2.3. 
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Our  focus  in  this  textbook  will  be  on  elementary  functions.  Examples  of  elementary  functions  in¬ 
clude  both  algebraic  functions,  such  as  polynomials,  rational  functions,  and  powers,  and  transcendental 
functions,  such  as  exponential,  logarithmic,  and  trigonometric  functions. 


Exercises  for  2.1 


Exercise  2.1.1  Find  the  domain  of  each  of  the  following  functions: 


(a)  y  =  x“  +  1 

(b)  y  =  f(x)  =  V2x-3 

(c)  y  =  f(x )  =  i/(*+i) 

(d)  y  =  f(x)  =  l/(.r-  1) 

(e)  y  =  f(x )  =  \/-l/x 

(f)  y  =  fix)  =  \[x 


(h)  y  =  f(x)  =  ffx 

(i)  y  =  V 1  —x2 

(j)  y  =  fix)  =  s/i  ~{i/x) 

(k)  y  =  fix)  =  1/ \  —  (3x)2 
(i)  y  =  fix )  =  v^+ 1) 


(g)  y  —  fix)  —  y  r2  —  (x  —  h)2,  where  r  and  h  are 
positive  constants. 


(m)  y  —  ffx)  =  1/iVf-l) 


Exercise  2.1.2  A  farmer  wants  to  build  a  fence  along  a  river.  He  has  500  feet  of  fencing  and  wants  to 
enclose  a  rectangular  pen  on  three  sides  (with  the  river  providing  the  fourth  side).  If  x  is  the  length  of  the 
side  perpendicular  to  the  river,  determine  the  area  of  the  pen  as  a  function  of  x.  What  is  the  domain  of  this 
function  ? 

Exercise  2.1.3  A  can  in  the  shape  of  a  cylinder  is  to  be  made  with  a  total  of  100  square  centimeters 
of  material  in  the  side,  top,  and  bottom;  the  manufacturer  wants  the  can  to  hold  the  maximum  possible 
volume.  Write  the  volume  as  a  function  of  the  radius  r  of  the  can;  find  the  domain  of  the  function. 
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Exercise  2.1.4  A  can  in  the  shape  of  a  cylinder  is  to  be  made  to  hold  a  volume  of  one  liter  (1000  cubic 
centimeters).  The  manufacturer  wants  to  use  the  least  possible  material  for  the  can.  Write  the  surface 
area  of  the  can  (total  of  the  top,  bottom,  and  side)  as  a  function  of  the  radius  r  of  the  can;  find  the  domain 
of  the  function. 


2.2  Transformations  and  Compositions 


2.2.1.  Transformations 


Transformations  are  operations  we  can  apply  to  a  function  in  order  to  obtain  a  new  function.  The  most 
common  transformations  include  translations  (shifts),  stretches  and  reflections.  We  summarize  these  be¬ 
low. 


Function 

Conditions 

How  to  graph  F(x)  given  the  graph  of  f(x) 

F(x)  =f(x)+c 

c  >  0 

Shift  /(x)  upwards  by  c  units 

F(x)  =  f{x)  —  c 

c  >  0 

Shift  /(x)  downwards  by  c  units 

F(x )  =f(x  +  c) 

c  >  0 

Shift  /(x)  to  the  left  by  c  units 

F(x)  —  f{x  —  c) 

c  >  0 

Shift  f(x)  to  the  right  by  c  units 

F(x)  =  -f{x) 

Reflect  f(x)  about  the  x-axis 

F(x)  =  f(-x) 

Reflect  /(x)  about  the  y-axis 

F{x )  =  |/M| 

Take  the  part  of  the  graph  of  /(x)  that  lies 

below  the  x-axis  and  reflect  it  about  the  x-axis 

For  horizontal  and  vertical  stretches,  different  resources  use  different  terminology  and  notation.  Use 
the  one  you  are  most  comfortable  with!  Below,  both  a,b  are  positive  numbers.  Note  that  we  only  use  the 
term  stretch  in  this  case: 


Function 

Conditions 

How  to  graph  F(x)  given  the  graph  of  /(x) 

F(x)  =  af(x) 

a  >  0 

Stretch  /(x)  vertically  by  a  factor  of  a 

F(x)  =  f{bx) 

b>  0 

Stretch  /(x)  horizontally  by  a  factor  of  1  /b 

In  the  next  case,  we  use  both  the  terms  stretch  and  shrink.  We  also  split  up  vertical  stretches  into  two 
cases  (0  <  a  <  1  and  a  >  1),  and  split  up  horizontal  stretches  into  two  cases  (0  <  b  <  1  and  b  >  1).  Note 
that  having  0  <  a  <  1  is  the  same  as  having  1/c  with  c  >  1 .  Also  note  that  stretching  by  a  factor  of  1  / c  is 
the  same  as  shrinking  by  a  factor  c. 


Function 

Conditions 

How  to  graph  F(x)  given  the  graph  of  /(x) 

F{x)  =c/(x) 

c  >  1 

Stretch  /(x)  vertically  by  a  factor  of  c 

Fix)  =  (l/c)/(x) 

c  >  1 

Shrink  /(x)  vertically  by  a  factor  of  c 

F{x)  =f(cx) 

c  >  1 

Shrink  /(x)  horizontally  by  a  factor  of  c 

F(x)  =  f(x/c) 

c  >  1 

Stretch  /(x)  horizontally  by  a  factor  of  c 
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Some  resources  keep  the  condition  0  <  c  <  1  rather  than  using  1/c.  This  is  illustrated  in  the  next  table. 


Function 

Conditions 

How  to  graph  F(x)  given  the  graph  of  f(x) 

F(x) 

=  df(x) 

d>  1 

Stretch  f(x)  vertically  by  a  factor  of  d 

F(x) 

=  df(x) 

0  <  d  <  1 

Shrink  f(x)  vertically  by  a  factor  of  1  /d 

F(x) 

=  f(dx) 

d>  1 

Shrink  f(x)  horizontally  by  a  factor  of  d 

Fix) 

=  f(dx) 

0  <  d  <  1 

Stretch  f(x)  horizontally  by  a  factor  of  1  /d 

Example  2.7:  Transformations  and  Graph  Sketching 


In  this  example  we  will  use  appropriate  transformations  to  sketch  the  graph  of  the  function 

y  =  \Vx  +  2-  1 1  —  1 


Solution.  We  start  with  the  graph  of  a  function  we  know  how  to  sketch,  in  particular,  y  —  y/x:  To  obtain 
the  graph  of  the  function  y  =  yGc  +  2  from  the  graph  y  —  y/x,  we  must  shift  y  —  y/x  to  the  left  by  2  units. 
To  obtain  the  graph  of  the  function  y  —  y/x  +  2  —  1  from  the  graph  y  =  y/x  +  2,  we  must  shift  y  =  \/x  +  2 
downwards  by  1  unit. 


To  obtain  the  graph  of  the  function  y  —  \  y/x  +  2  —  1 1  from  the  graph  y  =  y/x  +  2  —  1 ,  we  must  take  the  part 
of  the  graph  of  y  —  y/x  +  2  —  1  that  lies  below  the  x-axis  and  reflect  it  (upwards)  about  the  x-axis.  Finally, 
to  obtain  the  graph  of  the  function  y  —  \y/x  +  2  —  1 1  —  1  from  the  graph  y  —  \y/x  +  2  —  1|,  we  must  shift 
y  —  |  sJx  +  2  —  1 1  downwards  by  1  unit: 


X 
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2.2.2.  Combining  Two  Functions 


Let  /  and  g  be  two  functions.  Then  we  can  form  new  functions  by  adding,  subtracting,  multiplying,  or 
dividing.  These  new  functions,  f  +  g,f  —  g,fg  and  f/g,  are  defined  in  the  usual  way. 


r  7 

Operations  on  Functions 

if  +  g)(x)=f(x)+g(x)  (f 

(fg)(x)  =  f(x)g(x)  I 

|-k)W  =f(x)-g(x) 

( A  M  - 

{ gJU  g{x) 

Suppose  Df  is  the  domain  of  /  and  Dg  is  the  domain  of  g.  Then  the  domains  of  f  +  g,  f  —  g  and 
fg  are  the  same  and  are  equal  to  the  intersection  I) /■  f)  Dg  (that  is,  everything  that  is  in  common  to  both 
the  domain  of  /  and  the  domain  of  g).  Since  division  by  zero  is  not  allowed ,  the  domain  of  f/g  is 

{xeDfDDg  :g(x)  ^0}. 

Another  way  to  combine  two  functions  /  and  g  together  is  a  procedure  called  composition. 


Function  Composition 


Given  two  functions  /  and  g,  the  composition  of  /  and  g,  denoted  by  /  o  g,  is  defined  as: 

(f°g)(x)  =f(g(x)). 


The  domain  of  fog  is  {x  e  Dg  :  g(jc)  e  Df},  that  is,  it  contains  all  values  jc  in  the  domain  of  g  such 
that  g(x)  is  in  the  domain  of  /. 


Solution.  The  domain  of  f  is  Df  —  jx  e  R}.  The  domain  of  g  is  D„  —  \x  e  K. :  x  >  0}.  The  function 

JFm =/(*«)  - 

f(g(x))  =  (\/x)2  =x. 

Typically,  h(x)  —  x  would  have  a  domain  of  {x  G  M},  but  since  it  came  from  a  composed  function,  we 
must  consider  g(x)  when  looking  at  the  domain  of  f(g(x)).  Thus,  the  domain  of  fo  g  is  (r6l:x>  0}. 

* 


Example  2.9:  Combining  Two  Functions 


Let  f(x)  —x2  +  3  and  g(x)  —x  —  2.  Find  f  +  g,  f  —  g,  fg,  f/g,f°g  and  go  f.  Also,  determine  the 
domains  of  these  new  functions. 
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Solution  For  f  +  g  we  have: 

(f  +  g)  0)  =f(x)+g(x)  =  (x2  +  3)  +  (x-2)  =x2+x+l. 

For  f  —  g  we  have: 

( f-g)(x )  =  f(x)  —  g(x)  —  (x2  +  3)  -  (x  —  2)  =  x2  +  3  -x  +  2  —  x2  -  x  +  5. 
For  fg  we  have: 


(fg) 0)  =  f(x)  ■  g(x)  =  (xz  +  3)(x-2)  =x3  -  2x2  +  3x -  6. 


For  f/g  we  have: 


For  fog  we  have: 


For  go  f  we  have: 


f\  (x)  =  fW=  •y2  +  3 

Ju  g{x)  x  —  2 


(f°g)(x)  =  f(g(x))  =  f(x- 2)  =  (x-2)2  +  3  =x2-4.r  +  7. 


{g°f)(x)  =g(f(x))  =g(x2  +  3)  =  (x2  +  3)  — 2  =  xr  +  l. 

The  domains  of  f  +  g,  f  —  g,  fg,  fog  and  g  o  f  is  {x  G  M},  while  the  domain  of  // g  is  {x  G 

As  in  the  above  problem,  fog  and  g  o  f  are  generally  different  functions. 


:  xf-2}. 

X 


Exercises  for  2.2 


Exercise  2.2.1  Starting  with  the  graph  ofy  —  yfx,  the  graph  of  y  —  1/x,  and  the  graph  of  y  —  \J  \  —  x2 
(the  upper  unit  semicircle),  sketch  the  graph  of  each  of  the  following  functions: 


(a)  f(x)  —  Vx  —  2 

(g)  fix)  =  —4  +  —(x  —  2) 

(b)  f(x)  =  -1  -  l/(x  +  2) 

(h)  f{x)  =  2yJ  1  —  (x/3)2 

(c)  f(x)  —  4  +  Vx  +  2 

(i)  fix)  =  l/(x+  1) 

( d)  y  =  fix)  =x/(l-x) 

(j)  fix)=  4  +  2v/l-(x-5)2/9 

(e)  y  =  f{x )  =  -V^x 

(k)  fix)  =  1  T  l/(x  1) 

(f)  f(x)  =  2+  Jl-{x-  l)2 

(l)  fix)  =  J  100-25(x-  l)2  +  2 

Exercise  2.2.2  The  graph  of  f(x)  is  shown  below.  Sketch  the  graphs  of  the  following  functions. 
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(a)  y  =  f{x-  1) 

(b)  y  =  1  +f(x  +  2) 

(c)  y  =  1+2 f{x) 

( d)  y  =  2f(3x) 


(f)  y  —  (l/2)/(3x  —  3) 

(g)  y  =  f(  l+Jc/3)  +  2 

(h)  y=  |/(x)  -2| 


Exercise  2.2.3  Suppose  f(x)  —  3x  —  9  and  g(x)  —  yfx.  What  is  the  domain  of  the  composition  (g  o  /)  (x)  ? 

2.3  Exponential  Functions 


An  exponential  function  is  a  function  of  the  form  f(x)  —  ax,  where  a  is  a  constant.  Examples  are  2+  1 (T 
and  (1/2)+  To  more  formally  define  the  exponential  function  we  look  at  various  kinds  of  input  values. 

It  is  obvious  that  a5  =  aaaaa  and  a 3  —  a-  a-  a,  but  when  we  consider  an  exponential  function  ax 
we  can’t  be  limited  to  substituting  integers  for  x.  What  does  a2  5  or  a~l 3  or  aK  mean?  And  is  it  really  true 
that  a25a~ 13  =  a2  5-13?  The  answer  to  the  first  question  is  actually  quite  difficult,  so  we  will  evade  it;  the 
answer  to  the  second  question  is  “yes.” 

We’ll  evade  the  full  answer  to  the  hard  question,  but  we  have  to  know  something  about  exponential 
functions.  You  need  first  to  understand  that  since  it’s  not  “obvious”  what  2X  should  mean,  we  are  really 
free  to  make  it  mean  whatever  we  want,  so  long  as  we  keep  the  behavior  that  is  obvious,  namely,  when  x 
is  a  positive  integer.  What  else  do  we  want  to  be  true  about  2X2  We  want  the  properties  of  the  previous 
two  paragraphs  to  be  true  for  all  exponents:  2x2y  =  2x+y  and  ( 2x)y  =  2xy. 

After  the  positive  integers,  the  next  easiest  number  to  understand  is  0:  2°  =  1.  You  have  presumably 
learned  this  fact  in  the  past;  why  is  it  true?  It  is  true  precisely  because  we  want  2a2b  —  2a+b  to  be  true 
about  the  function  2+  We  need  it  to  be  true  that  2°2X  =  2°  fx  =  2X,  and  this  only  works  if  2°  =  1.  The  same 
argument  implies  that  a0  —  l  for  any  a. 

The  next  easiest  set  of  numbers  to  understand  is  the  negative  integers:  for  example,  2~3  =  1/23.  We 

_ O  _ O  Q  _ Q  i  O  r\  _ Q 

know  that  whatever  2  means  it  must  be  that  2  2—2  —2  =  1,  which  means  that  2  must  be 

1/23.  In  fact,  by  the  same  argument,  once  we  know  what  2X  means  for  some  value  of  x,  2~x  must  be  l/2x 
and  more  generally  a  ^  =  l/ax. 

Next,  consider  an  exponent  1  / q,  where  q  is  a  positive  integer.  We  want  it  to  be  true  that  (2x)y  —  2xy, 
so  ( 2^q)q  =  2.  This  means  that  2l^q  is  a  q- th  root  of  2,  2}^q  —  f/2  .  This  is  all  we  need  to  understand  that 

2p/q  =  (2  V«)P  =  (f[2)P  and  aP^  =  ( al/q)P  =  {^a)p. 


2.3.  Exponential  Functions  55 


What’s  left  is  the  hard  part:  what  does  2A  mean  when  x  cannot  be  written  as  a  fraction,  like  x  —  \/~l  or 
x—nl  What  we  know  so  far  is  how  to  assign  meaning  to  2X  whenever  x  —  p/q.  If  we  were  to  graph  ax 
(for  some  a  >  1)  at  points  x  —  p/q  then  we'd  see  something  like  this: 

y+ 


fr-r-yyr  i  i  i — i  i  i  i  i 


This  is  a  poor  picture,  but  it  illustrates  a  series  of  individual  points  above  the  rational  numbers  on  the 
v-axis.  There  are  really  a  lot  of  “holes”  in  the  curve,  above  x  —  K,  for  example.  But  (this  is  the  hard  part) 
it  is  possible  to  prove  that  the  holes  can  be  “filled  in”,  and  that  the  resulting  function,  called  ax ,  really  does 
have  the  properties  we  want,  namely  that  axay  =  ax+y  and  (ax)y  —  axy .  Such  a  graph  would  then  look  like 
this: 


y- 

[I 

— 1 — 1 — 1 — 1 — 1— 

X 

Three  Types  of  Exponential  Functions 

There  are  three  kii 
0  <  a  <  1: 

y- 

nds  of  exponential  functions  / 

^  y  ■ 

(x)  —  ax  depending  on  whethe: 

r  a  >  1,  a  =  1  or 

1  1  1  1  1  1  1  l"l 

/(t>  : 

a  > 

1  1  1  1  1  1  1  1  1  - 1  1  1  1  1  1  1  1  1 

X 

f(x) 

■  1 

1  1  1  1  1  1  1  1  1  - 1  1  1  1  1  1  1  1  1 

X 

=  1*  /(*)  : 

0  <  a 

1  1  1  1  1  1  1  1  1 

X 

=  ax 

<  i 
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Properties  of  Exponential  Functions 


The  first  thing  to  note  is  that  if  a  <  0  then  problems  can  occur.  Observe  that  if  a  —  —  1  then  (  —  l)x  is  not 
defined  for  every  x.  For  example,  x  —  1/2  is  a  square  root  and  gives  (  —  1) 1//2  =  \/— 1  which  is  not  a  real 
number. 


The  last  property  can  be  observed  from  the  graph.  If  a  >  1,  then  as  x  gets  larger  and  larger,  so  does  ax. 
On  the  other  hand,  as  x  gets  large  and  negative,  the  function  approaches  the  v-axis,  that  is,  ax  approaches 


0. 


Example  2.10:  Reflection  of  Exponential 


Determine  an  equation  of  the  function  after  reflecting  y  —  2X  about  the  line  x  —  —2. 


Solution.  First  reflect  about  the  y-axis  to  get  y  —  2~x.  Now  shift  by  2  x  2  =  4  units  to  the  left  to  get 
y  =  2~(x+4K  Side  note:  Can  you  see  why  this  sequence  of  transformations  is  the  same  as  reflection  in  the 
line  x  —  —2?  Can  you  come  up  with  a  general  rule  for  these  types  of  reflections?  £ 


Example  2.11:  Determine  the  Exponential  Function 


Determine  the  exponential  function  f(x)  —  kax  that  passes  through  the  points  (1,6)  and  (2, 18). 


Solution.  We  substitute  our  two  points  into  the  equation  to  get: 

x  —  l,y  =  6  — *  6  =  ka1 
x  =  2,y  =  18  — >-  18  =  ka2 

This  gives  us  6  —  ka  and  18  =  ka2.  The  first  equation  is  k  —  6/a  and  subbing  this  into  the  second  gives: 
18  =  (6/ a)a2.  Thus,  18  =  6 a  and  a  —  3.  Now  we  can  see  from  6  =  ka  that  k  =  2.  Therefore,  the  exponential 
function  is 


/«=  2-3T 
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X 

There  is  one  base  that  is  so  important  and  convenient  that  we  give  it  a  special  symbol.  This  number 
is  denoted  by  e  —  2.71828. . .  (and  is  an  irrational  number).  Its  importance  stems  from  the  fact  that  it 
simplifies  many  formulas  of  Calculus  and  also  shows  up  in  other  fields  of  mathematics. 


Solution.  For  domain,  we  cannot  divide  by  zero  or  take  the  square  root  of  negative  numbers.  Note  that 
one  of  the  properties  of  exponentials  is  that  they  are  always  positive!  Thus,  eK  +  1  >  0  (in  fact,  as  e*  >  0 
we  actually  have  that  ex  +  1  is  at  least  one).  Therefore,  ex  +  1  is  never  zero  nor  negative,  and  gives  no 
restrictions  on  x.  Thus,  the  domain  is  R.  X 


Exercises  for  2.3 


Exercise  2.3.1 
Exercise  2.3.2 
Exercise  2.3.3 

Exercise  2.3.4 


Determine  an  equation  of  the  function  y 
Find  the  y-intercept  of  fix)  —  4X  +  6. 
Find  the  y-intercept  of  f{x)  —  2  Q)*. 

i 

Find  the  domain  ofy  —  e  +  e*. 


ax  passing  through  the  point  (3,8). 


2.4  Inverse  Functions 


In  mathematics,  an  inverse  is  a  function  that  serves  to  “undo”  another  function.  That  is,  if  f{x)  produces 
y,  then  putting  y  into  the  inverse  of  /  produces  the  output  x.  A  function  /  that  has  an  inverse  is  called 
invertible  and  the  inverse  is  denoted  by  /-1.  It  is  best  to  illustrate  inverses  using  an  arrow  diagram: 
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Notice  how  /  maps  1  to  a,  and  /  1  undoes  this,  that  is,  /  1  maps  a  back  to  1.  Don’t  confuse  /  1  (x)  with 
exponentiation:  the  inverse  f~l  is  different  from  -y— . 

Not  every  function  has  an  inverse.  It  is  easy  to  see  that  if  a  function  /(x)  is  going  to  have  an  inverse, 
then  /(x)  never  takes  on  the  same  value  twice.  We  give  this  property  a  special  name. 

A  function  /(x)  is  called  one-to-one  if  every  element  of  the  range  corresponds  to  exactly  one  element 
of  the  domain.  Similar  to  the  Vertical  Line  Test  (VLT)  for  functions,  we  have  the  Horizontal  Line  Test 
(HLT)  for  the  one-to-one  property. 


Theorem  2.13:  The  Horizontal  Line  Test 


A  function  is  one-to-one  if  and  only  if  there  is  no  horizontal  line  that  intersects  its  graph  more  than 
once. 


Example  2.14:  Parabola  is  Not  One-to-one 


The  parabola  /(x)  =  x2  it  not  one-to-one  because  it  does  not  satisfy  the  horizontal  line  test.  For 
example,  the  horizontal  line  y  —  1  intersects  the  parabola  at  two  points,  when  x  =  -  1  and  x  =  1. 


We  now  formally  define  the  inverse  of  a  function. 


Thus,  if  /  maps  x  to  y,  then  /“ 1  maps  y  back  to  x.  This  gives  rise  to  the  cancellation  formulas: 

f  l{f(,x))  —  V  for  every  x  in  the  domain  of  /(x), 

/(/“ 1  (x) )  =  x,  for  every  x  in  the  domain  of  /~ 1  (x) . 


Solution.  Rather  than  trying  to  compute  a  formula  for  /  1  and  then  computing  /-1(5),  we  can  sim¬ 
ply  find  a  number  c  such  that  /  evaluated  at  c  gives  5.  Note  that  subbing  in  some  simple  values  (x  = 
—3,— 2, 1,0, 1,2,3)  and  evaluating /(x)  we  eventually  find  that /(l)  =  19  +  2(17)  + 1  +  1  =  5  and/(0)  =  1. 
Therefore,  /_1(5)  =  1  and/_1(l)  =0. 

To  compute  the  equation  of  the  inverse  of  a  function  we  use  the  following  guidelines. 
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Solution.  Starting  with  y  —  2x3  +  1  we  solve  for  x  as  follows: 


y-l  =  2x3 


->■ 


x  — 


Therefore,  /  1  (x)  =  * 

This  example  shows  how  to  find  the  inverse  of  a  function  algebraically.  But  what  about  finding  the 
inverse  of  a  function  graphically ?  Step  3  (switching  x  and  y)  gives  us  a  good  graphical  technique  to  find 
the  inverse,  namely,  for  each  point  (a,b)  where  f(a)  =  b,  sketch  the  point  (b,a)  for  the  inverse.  More 
formally,  to  obtain  /_1  (x)  reflect  the  graph  f{x)  about  the  line  y  —  x. 

y  =  x 

/ 

/ 


f 


Exercises  for  2.4 


\x\  one-to-one? 


Exercise  2.4.1  Is  the  function  f(x) 
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Exercise  2.4.2  Ifh(x)  —  ex  +x+  l,  find  h  1(2). 

Exercise  2.4.3  Find  a  formula  for  die  inverse  of  the  function  f{x)  — 


2.5  Logarithms 


Recall  the  three  kinds  of  exponential  functions  f(x)  =  ax  depending  on  whether  0  <a<  l,a  =  1  or  a  >  I : 


a  >  1 


0  <  a  <  1 


So  long  as  a  ^  1,  the  function  fix)  =  ax  satisfies  the  horizontal  line  test  and  therefore  has  an  inverse.  We 
call  the  inverse  of  cd  the  logarithmic  function  with  base  a  and  denote  it  by  logfl.  In  particular, 

log ax  —  y  •<=>■  ay—x. 


The  cancellation  formulas  for  logs  are: 

logf;  (ax)  —  x,  for  every  x  G  M, 
a]og“ix>  —  x,  for  every  x  >  0. 

Since  the  function  f(x)  —  ax  for  a  f  1  has  domain  R  and  range  (0,°°),  the  logarithmic  function  has  domain 
(0,°o)  and  range  R.  For  the  most  part,  we  only  focus  on  logarithms  with  a  base  larger  than  1  (i.e.,  a  >  1) 
as  these  are  the  most  important. 
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Notice  that  every  logarithm  passes  through  the  point  ( 1 , 0)  in  the  same  way  that  every  exponential  function 
passes  through  the  point  (0, 1). 

Some  properties  of  logarithms  are  as  follows. 


The  Natural  Logarithm 


As  mentioned  earlier  for  exponential  functions,  the  number  e  ~  2.71828 ...  is  the  most  convenient  base  to 
use  in  Calculus.  For  this  reason  we  give  the  logarithm  with  base  e  a  special  name:  the  natural  logarithm. 
We  also  give  it  special  notation: 

\ogex  —  lnx. 

You  may  pronounce  In  as  either:  “el  -  en”,  “lawn”,  or  refer  to  it  as  “natural  log”.  The  above  properties 
of  logarithms  also  apply  to  the  natural  logarithm. 

Often  we  need  to  turn  a  logarithm  (in  a  different  base)  into  a  natural  logarithm.  This  gives  rise  to  the 
change  of  base  formula. 


Example  2.19:  Combine  Logarithms 


Write  In  A  +  21n  B  —  InC  as  a  single  logarithm. 
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Solution.  Using  properties  of  logarithms,  we  have, 


lnA  +  21ni?  — InC 


InA  +  lnfi2  —  InC 

ln(A52)  —  InC 

AB2 

In - 

C 


X 


Example  2.20:  Solve  Exponential  Equations  using  Logarithms 


If  ex+2  =  6e2x,  then  solve  for  x. 


Solution.  Taking  the  natural  logarithm  of  both  sides  and  noting  the  cancellation  formulas  (along  with 
lne  —  1),  we  have: 

<A+2  =  6e2x 
lneJf+2  =  ln(6e2x) 
x  +  2  —  ln6  +  lne2x 
x  +  2  =  In  6  +  2x 
x  =  2  —  In  6 

X 


Solution.  “Taking  e"  of  both  sides  and  noting  the  cancellation  formulas,  we  have: 

^ln(2.v— 1)  _  g21n(.r) 

(2x—  1)  =  eln(*2) 

2x—  1  =  x2 

x2  —  2x  +  1  =  0 

(x-1)2  =  0 


Therefore,  the  solution  is  x  —  1 . 


X 
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Exercises  for  2.5 


Exercise  2.5.1 

Exercise  2.5.2 

Exercise  2.5.3 
Exercise  2.5.4 
Exercise  2.5.5 
Exercise  2.5.6 


Expand  log10((x  +  45)7(x  —  2)). 
Expand  log.  —  —  (7/;r) ' 


Write  log2  3x  +  171og2(x  —  2)  —  21og2(x2  +  4x+  1 )  as  a  single  logarithm. 
Solve  log2(l  +  yfx)  =  6  for  x. 

Solve  2X  =8  for  x. 

Solve  log2(log3(x))  =  1  for  x. 


2.6  Inverse  Trigonometric  Functions 


The  trigonometric  functions  frequently  arise  in  problems,  and  often  it  is  necessary  to  invert  the  functions, 
for  example,  to  find  an  angle  with  a  specified  sine.  Of  course,  there  are  many  angles  with  the  same  sine, 
so  the  sine  function  doesn’t  actually  have  an  inverse  that  reliably  “undoes”  the  sine  function.  If  you  know 
that  sin  A'  =  0.5,  you  can’t  reverse  this  to  discover  x,  that  is,  you  can’t  solve  for  x,  as  there  are  infinitely 
many  angles  with  sine  0.5.  Nevertheless,  it  is  useful  to  have  something  like  an  inverse  to  the  sine,  however 
imperfect.  The  usual  approach  is  to  pick  out  some  collection  of  angles  that  produce  all  possible  values  of 
the  sine  exactly  once.  If  we  “discard”  all  other  angles,  the  resulting  function  does  have  a  proper  inverse. 

The  sine  takes  on  all  values  between  —1  and  1  exactly  once  on  the  interval  [—n/2,n/2]. 


If  we  truncate  the  sine,  keeping  only  the  interval  [ — tt/2, 7t/2],  then  this  truncated  sine  has  an  inverse 
function.  We  call  this  the  inverse  sine  or  the  arcsine,  and  write  it  in  one  of  two  common  notation:  y  — 
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arcsin(x) ,  or  y  =  sin  1  (x) . 


Recall  that  a  function  and  its  inverse  undo  each  other  in  either  order,  for  example,  ( \/x) 3  =  x  and 
=  x.  This  does  not  work  with  the  sine  and  the  “inverse  sine”  because  the  inverse  sine  is  the  inverse 
of  the  truncated  sine  function,  not  the  real  sine  function.  It  is  true  that  sin  (arc  sin  (x))  =  x,  that  is,  the 
sine  undoes  the  arcsine.  It  is  not  true  that  the  arcsine  undoes  the  sine,  for  example,  sin (57r/6)  =  1/2  and 
arcsin(l/2)  —  n/ 6,  so  doing  first  the  sine  then  the  arcsine  does  not  get  us  back  where  we  started.  This  is 
because  5tt/6  is  not  in  the  domain  of  the  truncated  sine.  If  we  start  with  an  angle  between  —n/2  and  k/2 
then  the  arcsine  does  reverse  the  sine:  sin(7r/6)  =  1/2  and  arcsin(l/2)  —  k/6. 


Solution.  These  come  directly  from  the  graph  of  y  =  arcsinx: 

sin_1(0)  =  0  sin_1(l)  =  ^  sin_1(  —  1)  =  —  ^ 


* 

We  can  do  something  similar  for  the  cosine  function.  As  with  the  sine,  we  must  first  truncate  the  cosine 
so  that  it  can  be  inverted,  in  particular,  we  use  the  interval  [0,  k]  . 
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Note  that  the  truncated  cosine  uses  a  different  interval  than  the  truncated  sine,  so  that  if  y  —  arccos(x)  we 
know  that  0  <  y  <  K. 


Solution.  These  come  directly  from  the  graph  of  y  —  arccosx: 

cos_1(0)  =  -^  cos_1(l)=0  cos_ 1  ( — 1)  =  71 


Finally  we  look  at  the  tangent;  the  other  trigonometric  functions  also  have  “partial  inverses”  but 
the  sine,  cosine  and  tangent  are  enough  for  most  purposes.  The  truncated  tangent  uses  an  interval  of 

(—k/2,k/2). 
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Reflecting  the  truncated  tangent  in  the  line  y  —  *  gives  the  arctangent  function. 


Example  2.24:  Arctangent  of  Common  Values 


Compute  tan  1  (0) .  What  value  does  tan  1  *  approach  as  x  gets  larger  and  larger?  What  value  does 
tan” 1  v  approach  as  x  gets  large  (and  negative)? 


Solution.  These  come  directly  from  the  graph  of  y  —  arctan*.  In  particular,  tan”1  (0)  =  0.  As  *  gets  larger 
and  larger,  tan”1*  approaches  a  value  of  |,  whereas,  as  x  gets  large  but  negative,  tan”1*  approaches  a 
value  of— | .  X 

The  cancellation  rules  are  tricky  since  we  restricted  the  domains  of  the  trigonometric  functions  in  order 
to  obtain  inverse  trig  functions: 


Solution  Since  sin_1(*)  outputs  values  in  [ — tt/ 2,  k/ 2],  the  answer  must  be  in  this  interval.  Let  0  — 
sin”1  (1/2).  We  need  to  compute  0.  Take  the  sine  of  both  sides  to  get  sin0  =  sin  (sin”1  (1/2))  =  1/2  by 
the  cancellation  rule.  There  are  many  angles  6  that  work,  but  we  want  the  one  in  the  interval  [— tt/2,  n/2]. 

Thus,  6  =  k/ 6  and  hence,  sin~ 


K 

6' 
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Solution.  Since  cos  !(x)  outputs  values  in  [0,  n],  the  answers  must  be  in  this  interval.  The  first  two  we 
can  cancel  using  the  cancellation  rules: 

cos_1(cos(0))  =  0  and  cos-1  (cos(7r))  =  K. 

The  third  one  we  cannot  cancel  since  2 n  £  [0,  n\ : 

cos_1(cos(27r))  is  NOT  equal  to  2%. 

But  we  know  that  cosine  is  a  27T-periodic  function,  so  cos(27t)  =  cos(0): 

cos-1(cos(27t))  =  cos_1(cos(0))  =  0 

Similarly  with  the  fourth  one,  we  can  NOT  cancel  yet  since  3 K  £  [0,  k\  .  Using  cos(37t)  =  cos(37T  —  2k)  — 
cos(7r): 

cos_1(cos(37r))  =  cos_1(cos(7r))  =  K. 

X 


Example  2.27:  The  Triangle  Technique 


Rewrite  the  expression  cos(sin  1  x)  without  trig  functions.  Note  that  the  domain  of  this  function  is 
allx  G  [—1, 1]. 


Solution.  Let  0  —  sin  1  x.  We  need  to  compute  cos0.  Taking  the  sine  of  both  sides  gives  sin0  = 
s  i  n  ( s  i  n  1  (.*))  —  xhy  the  cancellation  rule.  We  then  draw  a  right  triangle  using  sin  0  =xj  1 : 


If  z  is  the  remaining  side,  then  by  the  Pythagorean  Theorem: 

z2+x2  —  1  — >  z2  —  l—x2  — »  z  —  ±  V 1  —  X2 

and  hence  z  —  +\/l  —  x2  since  9  G  [—n/2,  k/2].  Thus,  cos0  =  \/l  —  x2  by  SOH  CAH  TOA,  so, 
cos(sin~1A:)  =  \J  1  —x2.  X 
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Example  2.28:  The  Triangle  Technique  2 


For  x  E  (0,1),  rewrite  the  expression  sin(2  cos  1  x) .  Compute  sin(2  cos  1  ( 1/2) ) . 


Solution.  Let  0  =  cos  1  x  so  that  cos  0  —  x.  The  question  now  asks  for  us  to  compute  sin(20).  We  then 
draw  a  right  triangle  using  cos  0  =  jc/1  : 


/l  —  X2 


To  find  sin(20)  we  use  the  double  angle  formula  sin(20)  =  2sin0cos0.  But  sin0  =  \J\  —  x1,  for  0  e 
[0,  tt] ,  and  cos  0  =  x.  Therefore,  sin(2cos_1x)  =  2x\/ 1  —  x2.  When.r=  1/2  we  have  sin(2cos_1(l/2))  = 


Exercises  for  2.6 


Exercise  2.6.1  Compute  the  following: 

(a)  sin_1(v/3/2)  (b)  cos-1  (— y/l/2) 

Exercise  2.6.2  Compute  the  following: 

(a)  sin-1  (sin(7r/4))  (c)  cos  (cos_1(l/3)) 

(b)  sin-1  (sin(17^/3))  (d)  tan  (cos-1  (—4/5)) 

Exercise  2.6.3  Rewrite  the  expression  tan  (cos  - 1  xj  without  trigonometric  functions.  What  is  the  domain 
of  this  function? 


2.7  Hyperbolic  Functions 


The  hyperbolic  functions  appear  with  some  frequency  in  applications,  and  are  quite  similar  in  many  re¬ 
spects  to  the  trigonometric  functions.  This  is  a  bit  surprising  given  our  initial  definitions. 
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Notice  that  cosh  is  even  (that  is,  cosh(— x)  =  cosh(x))  while  sinh  is  odd  (sinh(— x)  =  —  sinh(x)),  and 
coshx  +  sinhx  =  e* .  Also,  for  all  x,  coshx  >  0,  while  sinhx  =  0  if  and  only  if  ex  —  e~x  —  0,  which  is  true 
precisely  when  x  =  0. 


Proof.  Let  y  —  coshx.  We  solve  for  x: 

gx  _|_  e-x 

y  =  - 2“ 

2y  =  + 

2yex  =  e2x  +  l 

0  =  e2x-2yex+l 
X  _  2y±\J  4y2  —  4 

e  2 _ 

ex  =  y  ±  \/y2  -  1 

r\ 

From  the  last  equation,  we  see  y  >  1,  and  since  y  >  0,  it  follows  that  y  >  1 . 

Now  suppose  y  >  1,  so  y  ±  \/y2  —  1  >  0.  Then  x  =  ln(y  =t  \fy2  —  1)  is  a  real  number,  and  y  =  coshx, 
so  y  is  in  the  range  of  cosh(x) .  X 


Definition  2.31:  Hyperbolic  Functions 


We  can  also  define  hyperbolic  functions  for  the  other  trigonometric  functions  as  you  would  expect: 


,  sinhx  ,1  ,1  ,1 

tanhx  = -  cschx  = -  sechx  = -  cothx  = - 

coshx  sinhx  coshx  tanhx 
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The  graph  of  sinhx  is  shown  below: 


The  graph  of  coshx  is  shown  below: 


Solution.  This  uses  the  definitions  of  the  hyperbolic  functions. 

gln2  —  g”ln2  2- (1/2) 

tanh(ln2l  sinh(ln2)  2  2~(1/2)  3 

1  J  cosh(ln2)  eln2  +  e~ln2  2+ (1/2)  2  +  (l/2)  5 

2  2 

X 

Certainly  the  hyperbolic  functions  do  not  closely  resemble  the  trigonometric  functions  graphically. 
But  they  do  have  analogous  properties,  beginning  with  the  following  identity. 
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Proof.  The  proof  is  a  straightforward  computation: 

-x\2  ( „—x\2  „2x 

cosh  x  —  sinh  x  = 


(ex  +  e~x)2  {ex  -  e~x  f  _e2x  +  2  +  e~2x  -  e 2x  +  2  -  e"2*  _  4  _  1 

4  4  =  4  =  4  ~  ' 


This  immediately  gives  two  additional  identities: 

9  9  9  9 

1  —  tanh  x  =  sech  x  and  cotfr  x  —  1  =  csch  x. 


The  identity  of  the  theorem  also  helps  to  provide  a  geometric  motivation.  Recall  that  the  graph  of 
x2  —  v2  =  1  is  a  hyperbola  with  asymptotes  x  =  ±y  whose  x-intercepts  are  ±1.  If  (x,y)  is  a  point  on  the 

right  half  of  the  hyperbola,  and  if  we  let  x  =  coshf,  then  y  —  ± \  A2  —  1  =  ±\/ cosh2x—  1  =  ±  sinht.  So 
for  some  suitable  t,  cosh  /  and  sinh/  are  the  coordinates  of  a  typical  point  on  the  hyperbola.  In  fact,  it  turns 
out  that  t  is  twice  the  area  shown  in  the  first  graph  of  figure  2.4.  Even  this  is  analogous  to  trigonometry; 
cost  and  sin/  are  the  coordinates  of  a  typical  point  on  the  unit  circle,  and  t  is  twice  the  area  shown  in  the 
second  graph  of  Figure  2.4. 


2  2  i 

x  —  y  =  1 


Figure  2.4:  Geometric  definitions.  Here,  t  is  twice  the  shaded  area  in  each  figure. 

Since  coshx  >  0,  sinhx  is  increasing  and  hence  one-to-one,  so  sinhx  has  an  inverse,  arcsinhx.  Also, 
sinhx  >  0  when  x  >  0,  so  coshx  is  injective  on  [0,°°)  and  has  a  (partial)  inverse,  arccoshx.  The  other 
hyperbolic  functions  have  inverses  as  well,  though  arcsechx  is  only  a  partial  inverse. 


Exercises  for  2.7 


Exercise  2.7.1  Show  that  the  range  of  sinhx  is  all  real  numbers.  (Hint:  show >  that  if  y  —  sinhx  then 
x  =  ln(y+  \/y2  +  1)9 
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Exercise  2.7.2  Show  that  the  range  of  tanhx  is  (—1,1).  What  are  the  ranges  o/coth,  sech,  and  csch?  (Use 
the  fact  that  they  are  reciprocal  functions.) 

Exercise  2.7.3  Prove  that  for  every  x,y  G  E,  sinh(x  +  _y)  =  sinhxcosh  y  +  coshxsinhv.  Obtain  a  similar 
identity  for  sinh(x  —  _y). 

Exercise  2.7.4  Prove  that  for  every  x,y  G  M,  cosh(x  +  _y)  =  c  o  s  h  x  c  o  s  h  y  +  sinh  x  sinh  v.  Obtain  a  similar 
identity  for  cosh(x  —  y). 

Exercise  2.7.5  Show  that  sinh(2x)  =  2sinhxcoshx  and  cosh(2x)  =  cosh2  x  + sinh2  x/or  every  x.  Conclude 
also  that  (cosh(2x)  —  l)/2  =  sinh2x. 

Exercise  2.7.6  What  are  the  domains  of  the  six  inverse  hyperbolic  functions? 

Exercise  2.7.7  Sketch  the  graphs  of  all  six  inverse  hyperbolic  functions. 


2.8  Additional  Exercises 


Exercise  2.8.1  Iff(x) 


- then  which  of  the  following  is  equal  to  f 

x  —  1 


? 


(a)  /(x) 

(b)  -f(x) 

(c)  x/(x) 


( d)  -xf(x) 
fix 


fix 


x  fix')  f(  2^1 

Exercise  2.8.2  If  fix)  — - then  find  and  simplify - . 

x+3  x— 2 

/*/  ^  j  c  /  ^  \ 

Exercise  2.8.3  If  f{x)  =  x2,  then  find  and  simplify - - - 

h 

Exercise  2.8.4  What  is  the  domain  of 

sJx-2  9 
x2  —  9 


(a)  fix ) 


2.8.  Additional  Exercises  73 


(b)  g(x) 


s/x-2  ^ 
x2  —  9 


f 

Exercise  2.8.5  Suppose  that  f(x)  =  x5  and  g(x)  =  x.  What  is  the  domain  of  —  ? 

§ 

Exercise  2.8.6  Suppose  that  fix)  =  3x  —  4.  Find  a  function  g  such  that  (g  o  /')  (.x)  =  5x  +  2. 


Exercise  2.8.7  Which  of  the  following  functions  is  one-to-one? 

(a)  f(x)—x2  +  4x  +  3 

(b)  g{x)  =  \x\+2 

(c)  h(x)  —  \/x+  1 

(d)  F(x)  —  cosx,  —  K  <x<K 

(e)  G(x)=ex  +  e-x 


Exercise  2.8.8  What  is  the  inverse  off(x)  =  In 


?  What  is  the  domain  of  f  1  ? 


Exercise  2.8.9  Solve  the  following  equations. 

(a)  e2~x  =  3 

(b)  e-*2  = 

(c)  ln(l  +  y/x)  =2 

(d)  ln(x2  —  3)  =  ln2  +  lnx 


Exercise  2.8.10  Find  the  exact  value  of  sin 

Exercise  2.8.11  Find  sin-1  (sin( 23n/5)). 
Exercise  2.8.12  It  can  be  proved  that  fix) 

Exercise  2.8.13  Sketch  the  graph  of  f(x)  = 


x 3  +x  +  ex  1  is  one-to-one.  What  is  the  value  off  1  (3)  ? 

—x  ifx  <  0 
tan_1x  ifx>  0 


3.  Limits 


3.1  The  Limit 


The  value  a  function  /  approaches  as  its  input  x  approaches  some  value  is  said  to  be  the  limit  of  /.  Limits 
are  essential  to  the  study  of  calculus  and,  as  we  will  see,  are  used  in  defining  continuity,  derivatives,  and 
integrals. 

Consider  the  function 


/(*)  = 


x2-! 
x—  1 


Notice  that  x  =  1  does  not  belong  to  the  domain  of  /(x).  Regardless,  we  would  like  to  know  how  /(x) 
behaves  close  to  the  point  x  =  1.  We  start  with  a  table  of  values: 


X 

/(*) 

0.5 

1.5 

0.9 

1.9 

0.99 

1.99 

1.01 

2.01 

1.1 

2.1 

1.5 

2.5 

It  appears  that  for  values  of  x  close  to  1  we  have  that  /(x)  is  close  to  2.  In  fact,  we  can  make  the  values 
of  /(x)  as  close  to  2  as  we  like  by  taking  x  sufficiently  close  to  1.  We  express  this  by  saying  the  limit  of 
the  function  /(x)  as  x  approaches  1  is  equal  to  2  and  use  the  notation: 

lim  f(x)  =  2. 

x — y  1 


We  read  the  expression  limx_>.a/(x)  —  L  as  “the  limit  of  f(x)  as  x  approaches  a  is  equal  to  L".  When 
evaluating  a  limit,  you  are  essentially  answering  the  following  question:  What  number  does  the  function 
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approach  while  x  gets  closer  and  closer  to  a  (but  not  equal  to  a )?  The  phrase  but  not  equal  to  a  in  the 
definition  of  a  limit  means  that  when  finding  the  limit  of  /(x)  as  x  approaches  a  we  never  actually  consider 
x  =  a.  In  fact,  as  we  just  saw  in  the  example  above,  a  may  not  even  belong  to  the  domain  of  /.  All  that 
matters  for  limits  is  what  happens  to  /  close  to  a,  not  necessarily  what  happens  to  /  at  a. 

One-sided  limits 


Observe  from  the  graph  that  as  x  gets  closer  and  closer  to  1  from  the  left,  then  /(x)  approaches  +1. 
Similarly,  as  x  gets  closer  and  closer  1  from  the  right,  then  /(x)  approaches  +2.  We  use  the  following 
notation  to  indicate  this: 

lim  /(x)  =  1  and  lim  f(x)  —  2. 

X — ^1  X — ^l-^- 

The  symbol  x  — *  1“  means  that  we  only  consider  values  of  x  sufficiently  close  to  1  which  are  less  than  1. 
Similarly,  the  symbol  x  — *  1+  means  that  we  only  consider  values  of  x  sufficiently  close  to  1  which  are 
greater  than  1 . 


We  note  the  following  fact: 


lim/(x)  —  L  if  and  only  if 

x— >a 


lim  /(x)  =  L 

x — ya~ 


and 


lim  /(x)  =  L. 

x — ya^ 
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Or  more  concisely: 

lim  /(x)  —  lim  f(x)  —  L. 

x—>a~~  x— >a+ 

A  consequence  of  this  fact  is  that  if  the  one-sided  limits  are  different ,  then  the  two-sided  limit  lim  fix) 

x — fa 

does  not  exist,  often  denoted  as:  (DNE). 


Exercises  for  Section  3.1 


sinx 

Exercise  3.1.1  Use  a  calculator  to  estimate  lim - ,  where  x  is  in  radians. 

x — ^0  X 


tan(3x) 

Exercise  3.1.2  Use  a  calculator  to  estimate  lim  — — — ,  where  x  is  in  radians. 

-v— >o  tan(5x) 

be  —  1 1  .  Ijc  —  1 1 

Exercise  3.1.3  Use  a  calculator  to  estimate  lim  - T  and  lim  - T. 

x^i+l—  x~  *->i-  1  —  xz 


3.2  Precise  Definition  of  a  Limit 


The  definiton  given  for  a  limit  previously  is  more  of  a  working  definition.  In  this  section  we  pursue  the 
actual,  official  definition  of  a  limit. 


Definition  3.3:  Precise  Definition  of  Limit 


Suppose  f  is  a  function.  We  say  that  lim  fix)  =  L  if  for  every  e  >  0  there  is  a  8  >  0  so  that  whenever 

X-^fCL 

0  <  \x  —  a\  <  8,  |  f(x)  —L\  <  £. 


The  £  and  8  here  play  exactly  the  role  they  did  in  the  preceding  discussion.  The  definition  says,  in  a 
very  precise  way,  that  /(x)  can  be  made  as  close  as  desired  to  L  (that’s  the  |/(x)  —  L\  <  £  part)  by  making 
x  close  enough  to  a  (the  0  <  \x  —  a\  <  8  part).  Note  that  we  specifically  make  no  mention  of  what  must 
happen  if  x  —  a,  that  is,  if  |x  —  a\  =0.  This  is  because  in  the  cases  we  are  most  interested  in,  substituting 
a  for  x  doesn’t  make  sense. 

Make  sure  you  are  not  confused  by  the  names  of  important  quantities.  The  generic  definition  talks 
about  /(x),  but  the  function  and  the  variable  might  have  other  names.  The  x  was  the  variable  of  the 
original  function;  when  we  were  trying  to  compute  a  slope  or  a  velocity,  x  was  essentially  a  fixed  quantity, 
telling  us  at  what  point  we  wanted  the  slope.  In  the  velocity  problem,  it  was  literally  a  fixed  quantity,  as  we 
focused  on  the  time  t  —  2.  The  quantity  a  of  the  definition  in  all  the  examples  was  zero:  we  were  always 
interested  in  what  happened  as  Ax  became  very  close  to  zero. 

Armed  with  a  precise  definition,  we  can  now  prove  that  certain  quantities  behave  in  a  particular  way. 
The  bad  news  is  that  even  proofs  for  simple  quantities  can  be  quite  tedious  and  complicated.  The  good 
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news  is  that  we  rarely  need  to  do  such  proofs,  because  most  expressions  act  the  way  you  would  expect, 
and  this  can  be  proved  once  and  for  all. 


Solution.  This  is  not  something  we  “need”  to  prove,  since  it  is  “obviously”  true.  But  if  we  couldn’t  prove 
it  using  our  official  definition  there  would  be  something  very  wrong  with  the  definition. 

As  is  often  the  case  in  mathematical  proofs,  it  helps  to  work  backwards.  We  want  to  end  up  showing 
that  under  certain  circumstances  x  +  4  is  close  to  6;  precisely,  we  want  to  show  that  |x  +  4  —  6|  <  £,  or 
\x  —  2|  <  £.  Under  what  circumstances?  We  want  this  to  be  true  whenever  0  <  |jc  —  2|  <  8.  So  the  question 
becomes:  can  we  choose  a  value  for  8  that  guarantees  that  0  <  \x  —  2\  <8  implies  \x  —  2|  <  £?  Of  course: 
no  matter  what  £  is,  8  —  £  works.  X 

So  it  turns  out  to  be  very  easy  to  prove  something  “obvious,”  which  is  nice.  It  doesn’t  take  long  before 
things  get  trickier,  however. 


Solution.  We  will  want  to  be  able  to  show  that  |jc2  —  4|  <  £  whenever  0<  \x  —  2\  <  8,  by  choosing  8 
carefully.  Is  there  any  connection  between  \x  —  2\  and  \x2  —  4|?  Yes,  and  it’s  not  hard  to  spot,  but  it  is  not 
so  simple  as  the  previous  example.  We  can  write  |x2  —  4|  =  |  (jc+ 2)  (x  —  2)  \ .  Now  when  \x  —  2\  is  small,  part 
of  |  (x  +  2)  (x  —  2)  |  is  small,  namely  (x  —  2) .  What  about  (x  +  2)  ?  If  x  is  close  to  2,  (x  +  2)  certainly  can’t  be 
too  big,  but  we  need  to  somehow  be  precise  about  it.  Let’s  recall  the  “game”  version  of  what  is  going  on 
here.  You  get  to  pick  an  £  and  I  have  to  pick  a  8  that  makes  things  work  out.  Presumably  it  is  the  really  tiny 
values  of  £  I  need  to  worry  about,  but  I  have  to  be  prepared  for  anything,  even  an  apparently  “bad”  move 
like  £  =  1000.  I  expect  that  £  is  going  to  be  small,  and  that  the  corresponding  5  will  be  small,  certainly  less 
than  1.  If  8  <  1  then  |x  — (—  2 1  <  5  when  |x  —  2 1  <  5  (because  if  x  is  within  1  or  2,  then  x  is  between  1  and  3 
and x  + 2  is  between  3  and  5).  So  then  I’d  be  trying  to  show  that  |(x  +  2)(x  — 2)|  <  5|x  — 2|  <  £.  So  now  how 
can  I  pick  8  so  that  |x  — 2|  <5  implies  5|x  — 2|  <  £?  This  is  easy:  use  5  =  £/5,  so  5|x  —  2\  <  5(e/5)  —  £. 
But  what  if  the  £  you  choose  is  not  small?  If  you  choose  £  =  1000,  should  I  pick  8  =  200?  No,  to  keep 
things  “sane”  I  will  never  pick  a  8  bigger  than  1.  Here’s  the  final  “game  strategy”:  when  you  pick  a 
value  for  £,  I  will  pick  8  =  e/5  or  5=1,  whichever  is  smaller.  Now  when  |x  —  2|  <  8, 1  know  both  that 
|x  +  2|  <  5  and  that  |x  — 2|  <  e/5.  Thus  |(x  +  2)(x  —  2)\  <  5 (fi/5)  =  £. 

This  has  been  a  long  discussion,  but  most  of  it  was  explanation  and  scratch  work.  If  this  were  written 
down  as  a  proof,  it  would  be  quite  short,  like  this: 

Proof  that  limx2  =  4.  Given  any  £,  pick  8  =  e/5  or  3=1,  whichever  is  smaller.  Then  when  |x  —  2\  <  8, 

x — >2 

\x  +  2\  <  5  and  |x  — 2|  <  e/5.  Hence  |x2  —  4|  =  |(x  +  2)(x  —  2)\  <  5(£/5)  —  e.  X 

It  probably  seems  obvious  that  limx  =  4,  and  it  is  worth  examining  more  closely  why  it  seems  ob- 

jc— >2 

vious.  If  we  write  x2  =  x  •  x,  and  ask  what  happens  when  x  approaches  2,  we  might  say  something  like, 
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“Well,  the  first  x  approaches  2,  and  the  second  x  approaches  2,  so  the  product  must  approach  2  •  2.”  In  fact 
this  is  pretty  much  right  on  the  money,  except  for  that  word  “must.”  Is  it  really  true  that  if  x  approaches  a 
and  y  approaches  b  then  xy  approaches  abl  It  is,  but  it  is  not  really  obvious,  since  x  and  y  might  be  quite 
complicated.  The  good  news  is  that  we  can  see  that  this  is  true  once  and  for  all,  and  then  we  don’t  have  to 
worry  about  it  ever  again.  When  we  say  that  x  might  be  “complicated”  we  really  mean  that  in  practice  it 
might  be  a  function.  Here  is  then  what  we  want  to  know: 


Proof.  We  must  use  the  Precise  Definition  of  a  Limit  to  prove  the  Produce  Law  for  Limits.  So  given  any  £ 
we  need  to  find  a  8  so  that  0  <  \x  —  a\  <  8  implies  |  f(x)g(x)  —  LM\  <  £.  What  do  we  have  to  work  with? 
We  know  that  we  can  make  /(x)  close  to  L  and  g(x)  close  to  M,  and  we  have  to  somehow  connect  these 
facts  to  make  f(x)g(x)  close  to  LM. 

We  use,  as  is  often  the  case,  a  little  algebraic  trick: 

\f(x)g(x)-LM\  =  \f(x)g(x)  -  f(x)M  +  f(x)M  -  LM\ 

=  \f(x){g(x)  —M)  +  (f(x)  —L)M\ 

<  l/MUM  ~M)\  +  |(/(x)  -L)M | 

=  \f{x)\\g{x)  -  M\  +  \f{x)  -  l\\m\. 


This  is  all  straightforward  except  perhaps  for  the  “<”.  That  is  an  example  of  the  triangle  inequality, 
which  says  that  if  a  and  b  are  any  real  numbers  then  \aJrb\  <  \a\  +  \b\.  If  you  look  at  a  few  examples, 
using  positive  and  negative  numbers  in  various  combinations  for  a  and  b,  you  should  quickly  understand 
why  this  is  true.  We  will  not  prove  it  formally. 

Suppose  £  >  0.  Since  lim/(x)  =  L,  there  is  a  value  <5j  such  that  0  <  |x  —  a\  <  <5i  implies  |/(x)  —  L\  < 

x-^a 


This  means  that  0  <  |x  —  a\  <  5]  implies  |/(x)  —  L\\M\  <  \ f{x)  —  L|(l  +  \M\)  <  e/2. 


2(1  +  \M\y 

Now  we  focus  our  attention  on  the  other  term  in  the  inequality,  |/(x)  |  |g(x)  —  M\ .  We  can  make  |g(x)  — 
M |  smaller  than  any  fixed  number  by  making  x  close  enough  to  a;  unfortunately,  e/(2f(x))  is  not  a  fixed 
number,  since  x  is  a  variable.  Here  we  need  another  little  trick,  just  like  the  one  we  used  in  analyzing 
x1.  We  can  find  a  82  so  that  |x  —  a\  <82  implies  that  |/(x)  —  L\  <  1,  meaning  that  L  —  1  <  f{x)  <  L+  1. 
This  means  that  |/(x)|  <  N,  where  N  is  either  \L—  1|  or  |L+  1|,  depending  on  whether  L  is  negative  or 
positive.  The  important  point  is  that  N  doesn’t  depend  on  x.  Finally,  we  know  that  there  is  a  so  that 
0  <  \x  —  a\  <  83  implies  |g(x)  —M\  <  e/{2N).  Let  8  be  the  smallest  of  <5i,  81,  and  83.  Then  \x  —  a\  <  8 
implies  that  |/(x)  —L\  <e/(2(l  +  |M|)),  \f(x)\  <  N,  and  |g(x)  —  M\  <  e/(2N).  Then 


I  f(x)g(x)  -  LM |  <  |  f{x)  |  |g(x)  -M |  +  I/O)  -  L\  \M\ 

<N/n+  2(1  +  |M|)(1  +  |M|) 

£  £ 

=  2  +  2=C- 


This  is  just  what  we  needed,  so  by  the  official  definition,  lim/(x)g(x)  =  LM. 

x — Ya 


4 
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The  concept  of  a  one-sided  limit  can  also  be  made  precise. 


Exercises  for  Section  3.2 


Exercise  3.2.1  Give  an  £-8  proof  of  the  fact  that  I  i  in  ( 2x  —  5)  =  3. 

jt— >4 

Exercise  3.2.2  Let  £  be  a  small  positive  real  number.  How  close  to  2  must  we  hold  x  in  order  to  be  sure 
that  3x  +  1  lies  within  £  units  of  7? 


3.3  Computing  Limits:  Graphically 


In  this  section  we  look  at  an  example  to  illustrate  the  concept  of  a  limit  graphically. 

The  graph  of  a  function  f{x)  is  shown  below.  We  will  analyze  the  behaviour  of  f{x)  around  x  —  —5, 
v  =  —2,  x  —  —  1  and  x  —  0,  and  x  —  4. 


Observe  that  f(x)  is  indeed  a  function  (it  passes  the  vertical  line  test).  We  now  analyze  the  function  at 
each  point  separately. 
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x  =  —  5  :  Observe  that  at  x  =  —  5  there  is  no  closed  circle,  thus  /(— 5)  is  undefined.  From  the  graph  we 
see  that  as  x  gets  closer  and  closer  to  —5  from  the  left,  then  f(x)  approaches  2,  so 

lim  f(x)  =  2. 

x—t— 5 

Similarly,  as  x  gets  closer  and  closer  —5  from  the  right,  then  f(x')  approaches  —3,  so 

lim  f(x)  —  —3. 

jt-»— 5+ 

As  the  right-hand  limit  and  left-hand  limit  are  not  equal  at  —5,  we  know  that 

lim  f(x)  does  not  exist. 

x— >— 5 

x  =  —2  :  Observe  that  at  x  =  —2  there  is  a  closed  circle  at  0,  thus  /(— 2)  =  0.  From  the  graph  we  see 
that  as  x  gets  closer  and  closer  to  —2  from  the  left,  then  f(x)  approaches  3.5,  so 

lim  f(x)  =  3.5. 

x — y — 2 

Similarly,  as  x  gets  closer  and  closer  —2  from  the  right,  then  f(x')  again  approaches  3.5,  so 

lim  f(x)  =3.5. 

x->-  2+ 

As  the  right-hand  limit  and  left-hand  limit  are  both  equal  to  3.5,  we  know  that 

lim  f(x)  =  3.5. 

x — y — 2 

Do  not  be  concerned  that  the  limit  does  not  equal  0.  This  is  a  discontinuity,  which  is  completely  valid, 
and  will  be  discussed  in  a  later  section. 

We  leave  it  to  the  reader  to  analyze  the  behaviour  of  f(x)  for  x  close  to  —1  and  0. 

Summarizing,  we  have: 


/(— 5)  is  undefined 

/(— 2)  =  0 

/(-!)  =  -  2 

m  =  -2 

lim  f(x)  =  2 

x— ¥— 5 

lim  f{x)  =3.5 

x — y — 2 

lim  fix)  =0 

x — y —  1 

l™  /M  =  -2 

x — ^0 

lim  f(x)  =  —3 

x->- 5+ 

lim  f{x)  —  3.5 

.v  y-2~ 

lim  fix)  =  -2 

x — y — 1 ' 

lim  fix)  —  —2 

A'^0+ 

lim  f(x)  =  DNE 

x— 5 

lim  fix)  =  3.5 

x — y — 2 

lim  fix)  =  DNE 

x — y — 1 

lim  fix)  -  -2 

x — ^0 
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Exercises  for  Section  3.3 


Exercise  3.3.1  Evaluate  the  expressions  by  reference  to  this  graph: 


(a)  lim/(x) 

(e)  lim  /(x) 

(i)  lim/(x+l) 

x—>4 

x— >0+ 

x— >0 

(b)  lim  f{x) 

x— >— 3 

(f)  /(- 2) 

o)  m 

(c)  lim/(x) 

(g)  lim_/(x) 

(k)  lim  /(x-4) 

x — y  1 

(d)  lim /(» 

x — 

(h)  lim  _f(x) 

x — y — 2 

0)  lim  /(x-2) 

3.4  Computing  Limits:  Algebraically 


Properties  of  limits 


We  begin  by  deriving  a  handful  of  theorems  to  give  us  the  tools  to  compute  many  limits  without  explicitly 
working  with  the  precise  definition  of  a  limit. 
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Roughly  speaking,  these  rules  say  that  to  compute  the  limit  of  an  algebraic  expression,  it  is  enough 
to  compute  the  limits  of  the  “innermost  bits”  and  then  combine  these  limits.  This  often  means  that  it  is 
possible  to  simply  plug  in  a  value  for  the  variable,  since  lim.*  =  a. 

x—>a 


Solution.  If  we  apply  the  theorem  in  all  its  gory  detail,  we  get 

lim(x2  —  3x  +  5) 

lim(x  —  2) 

x — y  1 

(lim.*2)  —  (lim  3x)  +  (lim5) 

x — y  1  x — y  1  x — y  1 

(lim.*)  —  (lim  2) 

x — y  1  x — y  1 

(lim.*)2  —  3(limx)  +5 

x — y  1  x — y  1 

(lim.*)  —  2 

x — y  1 

l2  — 3-1  +  5 


x 

It  is  worth  commenting  on  the  trivial  limit  lim  5.  From  one  point  of  view  this  might  seem  meaningless, 

x — y  1 

as  the  number  5  can’t  “approach”  any  value,  since  it  is  simply  a  fixed  number.  However,  5  can,  and 


, .  x2  —  3x  +  5 

lim - - — 

x — y  1  x  2 
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should,  be  interpreted  here  as  the  function  that  has  value  5  everywhere,  /(x)  —  5,  with  graph  a  horizontal 
line.  From  this  point  of  view  it  makes  sense  to  ask  what  happens  to  the  values  of  the  function  (height  of 
the  graph)  as  x  approaches  1 . 

We’re  primarily  interested  in  limits  that  aren’t  so  easy,  namely,  limits  in  which  a  denominator  ap¬ 
proaches  zero.  There  are  a  handful  of  algebraic  tricks  that  work  on  many  of  these  limits. 


Solution.  We  can’t  simply  plug  in  x 

lim 


x — y  1 


1  because  that  makes  the  denominator  zero.  However: 
+  2x  —  3  (x  —  1)  (x  T  3) 

X— 1  x — ^  1  X— 1 


=  lim(x  +  3)  =  4 

x — y  1 


x 

The  technique  used  to  solve  the  previous  example  can  be  referred  to  as  factor  and  cancel.  Its  validity 
comes  from  the  fact  that  we  are  allowed  to  cancel  x  —  1  from  the  numerator  and  denominator.  Remember 
in  Calculus  that  we  have  to  make  sure  we  don’t  cancel  zeros,  so  we  require  x  —  1  0  in  order  to  cancel 

it.  But  looking  back  at  the  definition  of  a  limit  using  x  — >  1,  the  key  point  for  this  example  is  that  we  are 
taking  values  of  x  close  to  1  but  not  equal  to  1.  This  is  exactly  what  we  wanted  (x^  1)  in  order  to  cancel 
this  common  factor. 

While  Theorem  3.8  is  very  helpful,  we  need  a  bit  more  to  work  easily  with  limits.  Since  the  theorem 
applies  when  some  limits  are  already  known,  we  need  to  know  the  behavior  of  some  functions  that  cannot 
themselves  be  constructed  from  the  simple  arithmetic  operations  of  the  theorem,  such  as  y/x.  Also,  there  is 
one  other  extraordinarily  useful  way  to  put  functions  together:  composition.  If  /(x)  and  g(x)  are  functions, 
we  can  form  two  functions  by  composition:  f(g(x ))  and  g(/(x)).  For  example,  if  /(x)  =  y/x  and  g(x)  = 
x2  +  5,  then  f(g(x))  —  \/x2  +  5  and  g(f{x))  —  ( y/x)2  +  5  =  x  +  5.  Here  is  a  companion  to  Theorem  3.8 
for  composition: 


Note  the  special  form  of  the  condition  on  /:  it  is  not  enough  to  know  that  lim  /(x)  =  M,  though  it  is  a 

x — yL 

bit  tricky  to  see  why.  We  have  included  an  example  in  the  exercise  section  to  illustrate  this  tricky  point  for 
those  who  are  interested.  Many  of  the  most  familiar  functions  do  have  this  property,  and  this  theorem  can 
therefore  be  applied.  For  example: 


3.4.  Computing  Limits:  Algebraically  85 


This  theorem  is  not  too  difficult  to  prove  from  the  definition  of  limit. 

Another  of  the  most  common  algebraic  tricks  is  called  rationalization.  Rationalizing  makes  use  of  the 
difference  of  squares  formula  (a  —  b)  (a  4-  b)  —  a 2  —  b2.  Here  is  an  example. 


Solution. 


lim 

x — y — 1 


\/x  4"  5  —  2 

x+  1 


lim 

x — y —  1 


=  lim 


\/x  4~  5  —  2  \/x  -f-  5  4~  2 
x  T  1  \J x  4~  5  4~  2 

X4-5-4 


-t— i  (x  4-  1 )  ( yjx  4-  5  4-  2) 

..  x+ 1 

lim  - ,  - 

-t— i  (x  4-  1 )  ( \/x  4~  5  4-  2) 

r  1  1 

x-+-\  y/x+5  +  2  4 


At  the  very  last  step  we  have  used  Theorems  3.11  and  3.12. 


X 


Solution.  The  function  /(x)  =  x/|x|  is  undefined  at  0;  when  x  >  0,  |x|  =  x  and  so  /(x)  =  1;  when  x  <  0, 
|x|  =  —x  and  /(x)  =  —  1.  Thus 

x 

lim  —  =  lim  —  1  =  —  1 

x— |x|  x— 

while 

x 

lim  —  =  lim  1  =  1. 

x— ^0+  \X\  x— ^0+ 

X 

The  limit  of  /(x)  must  be  equal  to  both  the  left  and  right  limits;  since  they  are  different,  the  limit  lim  — 

x — >-0  |x| 

does  not  exist.  X 
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Exercises  for  3.4 


Exercise  3.4.1  Compute  the  limits.  If  a  limit  does  not  exist,  explain  why. 


(a)  lim 


x2  +x—  12 


x — >3  X  3 


(b)  lim 


x2  +x—  12 


x — y  1  x  3 

x2  +x—  12 


(c)  lim 


x  y — 4  x  3 


(d)  lim 


x2  +x  —  12 


(e)  lim 


x — >2  X  2 

\J  x  8  —  3 


x — y  1  x  1 


(f)  lim  \/-  +  2- 
x^o+  V  x 

(g)  lim  3 

x — >2 


(h)  lim3.r3—  5x 

x->4 


(i)  lim 


4x  —  5x2 


x — >0  X  1 

x2  —  1 


(j)  lim 


x — y  1  x  1 


y/2  —  x2 


(k)  lim 

*->0+  x 


(l)  lim 


V2- 


(m)  lim 


x— >-0+  X  -\-  1 

3  3 

x  —a 


x-^a  x  —  a 

.2  ,  A\ 3 


(n)  lim(^“  +  4) 

x — >2 


Exercise  3.4.2  Let  f(x)  — 


1  ifx  f  0 


and  g(x)  —  0.  What  are  the  values  of  L  —  limx_>o§(-,c)  and 


0  ifx  =  0 

M  —  I i mx-ft  f  (x)  ?  Is  it  true  that  limt_>o /(■?(•*))  =M?  What  are  some  noteworthy  differences  between  this 
example  and  Theorem  3.11? 


3.5  Infinite  Limits  and  Limits  at  Infinity 


We  occasionally  want  to  know  what  happens  to  some  quantity  when  a  variable  gets  very  large  or  “goes  to 
infinity”. 


Example  3.15:  Limit  at  Infinity 


What  happens  to  the  function  cos(l/x)  as  x  goes  to  infinity?  It  seems  clear  that  as  x  gets  larger 
and  larger,  1  / x  gets  closer  and  closer  to  zero,  so  cos(  1  /x)  should  be  getting  closer  and  closer  to 
cos(0)  =  1. 


As  with  ordinary  limits,  this  concept  of  “limit  at  infinity”  can  be  made  precise.  Roughly,  we  want 
lim  fix)  —L  to  mean  that  we  can  make  fix)  as  close  as  we  want  to  L  by  making  x  large  enough. 

x— y°° 
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We  include  this  definition  for  completeness,  but  we  will  not  explore  it  in  detail.  Suffice  it  to  say  that 
such  limits  behave  in  much  the  same  way  that  ordinary  limits  do;  in  particular  there  is  a  direct  analog  of 
Theorem  3.8. 


Solution.  As  x  goes  to  infinity  both  the  numerator  and  denominator  go  to  infinity.  We  divide  the  numerator 
and  denominator  by  x2: 

3  7 

,  ■  2x2  —  3x  +  7  r  r2 

lim  z - - - =  lim - f= — 

x^cc  x2  _|_  47^  _)_  i  x^-oo  47  1 

T  I  I 

x  xz 

Now  as  x  approaches  infinity,  all  the  quotients  with  some  power  of  x  in  the  denominator  approach  zero, 
leaving  2  in  the  numerator  and  1  in  the  denominator,  so  the  limit  again  is  2.  X 

In  the  previous  example,  we  divided  by  die  highest  power  ofx  that  occurs  in  the  denominator  in  order 
to  evaluate  the  limit.  We  illustrate  another  technique  similar  to  this. 


Solution.  As  x  becomes  large,  both  the  numerator  and  denominator  become  large,  so  it  isn’t  clear  what 
happens  to  their  ratio.  The  highest  power  of  x  in  the  denominator  is  x2,  therefore  we  will  divide  every  term 
in  both  the  numerator  and  denominator  by  x2  as  follows: 


2x2  +  3  2  +  3/x2 

lim  — r - =  lim  — — — — . 

5x“  +  x  5  +  1  /x 

Most  of  the  limit  rules  from  last  lecture  also  apply  to  infinite  limits,  so  we  can  write  this  as: 


lim  2  +  3  lim  — 

JC— JC— »°c>  X Z 

lim  5  +  lim  - 

x— x— x 


2  +  3(0)  _  2 
5  +  0  “  5' 


Note  that  we  used  the  theorem  above  to  get  that  lim  -  =  0  and  lim  =  0. 

x— x  x— >°°  xL 
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A  shortcut  technique  is  to  analyze  only  the  leading  terms  of  the  numerator  and  denominator.  A  leading 
term  is  a  term  that  has  the  highest  power  of  x.  If  there  are  multiple  terms  with  the  same  exponent,  you 
must  include  all  of  them. 

Top:  The  leading  term  is  lx2. 

Bottom:  The  leading  term  is  5x2. 

Now  only  looking  at  leading  terms  and  ignoring  the  other  terms  we  get: 


lx2  +  3  2x2 

lim  — = - =  lim  — -y 

5xz  +  x  x^-oo  5xz 


2 

5' 


X 


We  next  look  at  limits  whose  value  is  infinity  (or  minus  infinity). 


This  definition  can  be  modified  for  one-sided  limits  as  well  as  limits  with  x  — >  a  replaced  by  x  — >  °°  or 

X  — >  — oo. 


Solution.  One  might  be  tempted  to  write: 

lim  x  —  lim  x  =  °°  — 

X— X— 

however,  we  do  not  know  what  °°  —  oo  is,  as  °°  is  not  a  real  number  and  so  cannot  be  treated  like  one.  We 
instead  write: 

lim  (x3  —  x)  =  lim  x(x2  —  1). 

X— >°°  X— »°° 

As  x  becomes  arbitrarily  large,  then  both  x  and  x2  —  1  become  arbitrarily  large,  and  hence  their  product 
x(x2  —  1)  will  also  become  arbitrarily  large.  Thus  we  see  that 

lim(x3  —  x)  =  oo. 

x— >°° 


X 
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Often,  the  shorthand  notation  — —  =  +°°  and  —  = 

0+  o 

respectively: 

lim  -  =  +°°  and 

*->0+  x 


— is  used  to  represent  the  following  two  limits 


lim  -  =  —  °o. 

x^Q-  X 


Using  the  above  convention  we  can  compute  the  following  limits. 


Solution.  We  have: 

lim  e *  —  =  e+°°  =  °°. 

x->0+ 

i  j_ 

lim  e*  —  eo-  =  e  °°  —  0. 

Thus,  as  left-hand  limit  ^  right-hand  limit, 

lime*  =  DNE. 

x— 


X 


3.5.1.  Vertical  Asymptotes 


The  line  x 


a  is  called  a  vertical  asymptote  of  f{x)  if  at  least 


lim  f(x)  =  °°  lim  f{x)  =  °° 

x— x— >a~ 


one  of  the  following  is  true: 
lim  f(x)  —  °° 

jc->a+ 


lim  f{x)  = 

x — ya 


lim /W  = 

x— >a 


lim ,f(x)  = 

X — YQ,' 


—  00 


—  00 


—  00 
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Solution.  In  the  definition  of  vertical  asymptotes  we  need  a  certain  limit  to  be  ±°°.  Candidates  would  be 
to  consider  values  not  in  the  domain  of  f(x),  such  as  a  =  4.  As  x  approaches  4  but  is  larger  than  4  then 
x  —  4  is  a  small  positive  number  and  2x  is  close  to  8,  so  the  quotient  2x/(x  —  4)  is  a  large  positive  number. 
Thus  we  see  that 


x->4+  X  —  4 

Thus,  at  least  one  of  the  conditions  in  the  definition  above  is  satisfied.  Therefore  x  —  4  is  a  vertical 
asymptote.  X 

3.5.2.  Horizontal  Asymptotes 


The  line  y  —  L  is  a  horizontal  asymptote  of  f(x)  if  either 


lim  f(x)  —L  or 

x— 


lim  f{x)  —  L. 

x — ^ — 00 


Solution.  We  must  compute  two  infinite  limits.  First, 


lim 

x 

Notice  that  for  x  arbitrarily  large  that  x  >  0,  so  that  |jc|  =  x.  In  particular,  for  x  in  the  interval  (0,°°)  we 
have 

lim  M  =  lim  -  =  1. 

X— X  X— >°°  X 

Second,  we  must  compute 

lim  M. 

x^-°°  X 

Notice  that  for  x  arbitrarily  large  negative  that  x  <  0,  so  that  |x|  =  —x.  In  particular,  for  x  in  the  interval 
(—oo,0)  we  have 

lim  lim  —  =  -l. 

x->— °°  x  x->— °°  X 

Therefore  there  are  two  horizontal  asymptotes,  namely,  y  =  1  and  y  =  —  1 .  X 
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3.5.3.  Slant  Asymptotes 


Some  functions  may  have  slant  (or  oblique )  asymptotes,  which  are  neither  vertical  nor  horizontal.  If 

lim  [/(x)  —  (mx  +  b)]  =  0 

X — ^°° 

then  the  straight  line  y  =  mx  +  b  is  a  slant  asymptote  to  fix) .  Visually,  the  vertical  distance  between  fix) 
and  y  —  mx  +  b  is  decreasing  towards  0  and  the  curves  do  not  intersect  or  cross  at  any  point  as  x  approaches 
infinity.  Similarly  when  x  — >■  —  °o. 


Solution.  Note  that  this  function  has  no  horizontal  asymptotes  since  /(x )  — >  — as  x  — )■  °°  and  / (x)  — >•  °o 
as  x  -+  — 

In  rational  functions,  slant  asymptotes  occur  when  the  degree  in  the  numerator  is  one  greater  than  in 
the  denominator.  We  use  long  division  to  rearrange  the  function: 


— 3x2  +4 
x—  1 


— 3x  —  3  + 


1 

x—  1 


The  part  we’re  interested  in  is  the  resulting  polynomial  — 3x  —  3.  This  is  the  line  y  —  mx  +  b  we  were 
seeking,  where  m  =  —3  and  b  —  —3.  Notice  that 

_  3r2  +  4  1 

lim  — - - (— 3x  —  3)  =  lim - =  0 

x—>°°  x  —  1  x->°°  x  —  1 

and 

-3x2+4  1 

lim  - - - (— 3x  —  3)  =  lim  - -  =  0. 

x— 1 

Thus,  y  —  — 3x  —  3  is  a  slant  asymptote  of  /(x).  X 


Although  rational  functions  are  the  most  common  type  of  function  we  encounter  with  slant  asymptotes, 
there  are  other  types  of  functions  we  can  consider  that  present  an  interesting  challenge. 


Example  3.26:  Slant  Asymptote 


Show  that  y  —  2x  +  4  is  a  slant  asymptote  of  f{x)  —  2x  -  3A'  +  4. 


Solution.  This  is  because 

lim  [/(x)  -  (2x  +  4)]  =  lim  (-3*)  -  0. 

X— ¥— °°  X— >— °° 

X 

We  note  that  limc_>00[/(x)  —  (2x+4)]  =  limr_j.00(— 3*)  =  —  °°.  So,  the  vertical  distance  between  y  =  /(x) 
and  the  line  y  =  2x  + 4  decreases  toward  0  only  when  x  — and  not  when  x  — The  graph  of  / 
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approaches  the  slant  asymptote  y  =  2x  +  4  only  at  the  far  left  and  not  at  the  far  right.  One  might  ask  if 
y  =  fix)  approaches  a  slant  asymptote  when  x  — >  The  answer  turns  out  to  be  no,  but  we  will  need  to 
know  something  about  the  relative  growth  rates  of  the  exponential  functions  and  linear  functions  in  order 
to  prove  this.  Specifically,  one  can  prove  that  when  the  base  is  greater  than  1  the  exponential  functions 
grows  faster  than  any  power  function  as  x  — >  This  can  be  phrased  like  this:  For  any  a  >  1  and  any  n  >  0, 

ax  xn 

lim  —  —  and  lim  —  =0. 

x— »°°  xn  x— ax 

These  facts  are  most  easily  proved  with  the  aim  of  something  called  the  L’Hopital’s  Rule. 

3.5.4.  End  Behaviour  and  Comparative  Growth  Rates 


Let  us  now  look  at  the  last  two  subsections  and  go  deeper.  In  the  last  two  subsections  we  looked  at 
horizontal  and  slant  asymptotes.  Both  are  special  cases  of  the  end  behaviour  of  functions,  and  both  concern 
situations  where  the  graph  of  a  function  approaches  a  straight  line  as  x  — *  °°  or  —  °°.  But  not  all  functions 
have  this  kind  of  end  behviour.  For  example,  /(x)  =  x2  and  fix)  —  x3  do  not  approach  a  straight  line  as 
x  — *  °°  or  —co.  The  best  we  can  say  with  the  notion  of  limit  developed  at  this  stage  are  that 

lim  x  —  °°,  lim  x“  =  °°, 

X— X— >  —  °° 

lim  x3  =  °o,  lim  x3  =  —  °o. 

x— x — y — °° 

Similarly,  we  can  describe  the  end  behaviour  of  transcendental  functions  such  as  /(x)  =  ex  using  limits, 
and  in  this  case,  the  graph  approaches  a  line  as  x  — >  — but  not  as  x  — »  00. 

lim  ex  =  0,  lim  ex  — 

X^—oo  X— >00 

People  have  found  it  useful  to  make  a  finer  distinction  between  these  end  behaviours  all  thus  far 
captured  by  the  symbols  °°  and  —  °°.  Specifically,  we  will  see  that  the  above  functions  have  different 
growth  rates  at  infinity.  Some  increases  to  infinty  faster  than  others.  Specifically, 


Here  are  a  few  obvious  examples: 
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Example  3.28: 


Show  that  if  m  >  n  are  two  positive  integers,  then  fix)  =  xm  grows  faster  than  g(x)  —  x"  as  x  — > 


Solution.  Since  m  >  n,  m  —  n  is  a  positive  integer.  Therefore, 

fim  IfZl  =  iim  _  =  lim  fn~n  =  oo. 
x— yoo  gfx)  x->°°  xn  x — ^°° 


X 


Example  3.29: 


Show  that  if  m  >  n  are  two  positive  integers,  then  any  monic  polynomial  Pm(x)  of  degree  m  grows 
faster  than  any  monic  polynomial  P„(x)  of  degree  n  as  x  — >  [Recall  that  a  polynomial  is  monic  if 
its  leading  coefficient  is  1.] 


Solution.  By  assumption,  Pm(x)  —  xjn+  terms  of  degrees  less  than  m  =  xjn  +  am-\x'n  1  + . . .,  and  Pn(x)  — 

xn+  terms  of  degrees  less  than  n  —  xn  +  bri_\xn  1  + _  Dividing  the  numerator  and  denominator  by  x" , 

we  get 


lim 

x— 


f(-x ) 


lim 

x— >°° 


*n-n  +  am-  ixjn 


-n—  1 


+  ... 


1  +  %i  +  --- 


=  iimyn-"( 

x— ><*> 


1  _i_  ^m—  1  | 

'  X  '  *  *  * 

1  +  ^  +  ... 


oo 

5 


since  the  limit  of  the  bracketed  fraction  is  1  and  the  limit  of  xjn  n  is  °°,  as  we  showed  in  Example  3.28. 

X 


Example  3.30: 


Show  that  a  polynomial  grows  exactly  as  fast  as  its  highest  degree  term  as  x  °°  or  — oo.  That  is,  if 

P(x)  is  any  polynomial  and  Q(x)  is  its  highest  degree  term,  then  both  limits 


lim 

X— 


p  M 

GW 


and  lim 

x — y — 00 


fW 

GW 


are  finite  and  nonzero. 


Solution.  Suppose  that  P(x)  =  anxn  +  an  _  i xn  1  +  . . .  +  a\x  +  ao,  where  an  f  0.  Then  the  highest  degree 
term  is  Q  (x)  =  anxf\  So, 


lim 

X— 


fW 

GW 


lim 

X— >oo 


an- 1  <^1  Qq\ 

x  xn~ 1  xn ) 


Un  0. 


X 


Let’s  state  a  theorem  we  mentioned  when  we  discussed  the  last  example  in  the  last  subsection: 
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The  easiest  way  to  prove  this  is  to  use  the  L’Hopital’s  Rule,  which  we  will  introduce  in  a  later  chapter. 
For  now,  one  can  plot  and  compare  the  graphs  of  an  exponential  function  and  a  power  function.  Here  is  a 
comparison  between  f(x)  —  x2  and  g(x)  —  2X: 


Notice  also  that  as  x  — >  — xn  grows  in  size  but  e*  does  not.  More  specifically,  x"  — >  °°  or  —  °o 
according  as  n  is  even  or  odd,  while  ex  — >  0.  So,  it  is  meaningless  to  compare  their  “growth”  rates, 
although  we  can  still  calculate  the  limit 

e* 

lim  —  =  0. 

x— >—  °°  Xn 

Let’s  see  an  application  of  our  theorem. 


Solution.  To  find  horizontal  asymptotes,  we  calculate  the  limits  of  f(x)  as  x  — >  °°  and  x  — >■  — For  x  — > 
we  divide  the  numerator  and  the  denominator  by  ex,  and  then  we  take  limit  to  get 

x3  +  2eY  ^  +  2  0  +  2 

lim - — =  lim  — - T  = - — — 

x^>°°  ex  —  4x2  x^-oc  |  _  ^x_  i  -  4(0) 


2. 
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For  x  — »  — °o,  we  divide  the  numerator  and 

lim 

X— ¥— °° 

The  denominator  now  approaches  0  —  4  = 


the  denominator  by  x2  to  get 

x3  +  2ex  _  x  + 

xz 

—4.  The  numerator  has  limit  —  °°.  So,  the  quotient  has  limit  °o: 


x  +  2^ 

lim  —7 — —  =  oo. 

x — y — °°  4-4 

XZ 


So,  y  —  2  is  a  horizontal  asymptote.  The  function  y  —  /(x)  approaches  the  line  y  =  2  as  x  — >  And 
this  is  the  only  horizontal  asymptote,  since  the  function  y  —  /(x)  does  not  approach  any  horizontal  line  as 
x  — >■  —00.  X 


Since  the  growth  rate  of  a  polynomial  is  the  same  as  that  of  its  leading  term,  the  following  is  obvious: 


Also,  if  r  is  any  real  number,  then  we  can  place  it  between  two  consecutive  integers  n  and  n+  1 .  For 
example,  \/3  is  between  1  and  2,  e  is  between  2  and  3,  and  tz  is  between  3  and  4.  Then  the  following  is 
totally  within  our  expectation: 


Example  3.34: 


Prove  that  ifa>  1  is  any  basis  and  r  >  0  is  any  exponent,  then  /(x)  =  ax  grows  faster  than  g(x)  —  xr 
as  x  — »  00. 


Solution.  Let  r  be  between  consecutive  integers  n  and  n  +  1.  Then  for  all  x  >  1,  x"  <  xr  <  x"+1.  Dividing 
by  cix,  we  get 

xn  xr  xn+l 

—  <  —  < - . 

ax  ax  ax 


Since 


x» 

lim  —  =  0. 

x^oo  ax 


X 


What  about  exponential  functions  with  different  bases?  We  recall  from  the  graphs  of  the  exponential 
functions  that  for  any  base  a  >  1, 


lim  ax  —  00. 

x — 


So,  the  exponential  functions  with  bases  greater  than  1  all  grow  to  infinity  as  x  — >■  °°.  How  do  their 
growth  rates  compare? 
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Theorem  3.35: 


Ifl<a<b,  then  f  (x)  =  bx  grows  faster  than  g(x)  —  ax  as  x  — *  °°. 


Proof.  Proof.  Since  a  <  b,  we  have  ft  >  1.  So, 


lim  —  =  lim  -  =  °°. 


a  \  a 


* 

Another  function  that  grows  to  infinity  as  x  — >  °°  is  g(x)  —  lnx.  Recall  that  the  natural  logarithmic 
function  is  the  inverse  of  the  exponential  function  y  —  ex.  Since  ex  grows  very  fast  as  x  increases,  we 
should  expect  lnx  to  grow  very  slowly  as  x  increases.  The  same  applies  to  logarithmic  functions  with  any 
basis  a  >  1.  This  is  the  content  of  the  next  theorem. 


Proof. 


1 .  We  use  a  change  of  variable.  Letting  t  —  In x,  then  x  —  eb  So,  x  — >  °o  if  and  only  if  t  — »  °°,  and 

\nx  t  t 

lim  —  =  lim  — —  =  lim  . 

x — y°°  xr  t^oo  (ery  t— \€ry 

Now,  since  r  >  0,  a  =  er  >  1.  So,  a 1  grows  as  t  increases,  and  it  grows  faster  than  t  as  t  — *  °o. 
Therefore, 

ln.r  t  t 

lim  —  =  lim  - — —  =  lim  —  =  0. 

x1  (ery  r^'°°  cv 

lnjc 

2.  The  change  of  base  identity  logax  =  ^ —  implies  that  logax  is  simply  a  constant  multple  of  lnx.  The 
result  now  follows  from  (a). 


Exercises  for  3.5 


Exercise  3.5.1  Compute  the  following  limits. 
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(a)  lim  \J x 2  +  x  —  \fx2  —x 


x— 


(b)  lim 


+ 


x— »°o  ex  —  g  ' 

(1/0-1 


(c)  lim 


/->t+  ?2  —  2t  +  1 

/J4  t  +  5  —  2/t  —  \/t3 

<d>  JiS  3f  +  12—  1/f2 

M  Hm  Vy+I  +  VS^T 

;y— y 


(h)  lim 

t — 


(i)  lim 

? — ^°° 


(j) 


..  x  +  x  1 
lim  - == 

*-»— °°  1  +  v  1  —  x 


(k)  lim 


l+  x  —  1 


(!)  lim 

x-^-0+ 


3  +x-1/2  +x_1 
23-4x~1/2 


(g) 


x  +  x1/2+x1/3 
-t-^00  x2/3  _)_  jfl/4 


(l)  lim 

x— >00 


x^+x^1/2 

X  +  X”1/2 


fmj  lim 


x  +  x 


2x  +  x 


/  1  r  5+X  1 
(n)  lim  , 

.v-j.00  1  +  2x  1 


W  lim  -  4* 

x  ^°°  a/  2x2  +  1 


(p)  lim  (x  +  5) 

X — ^°° 


1 

x  3~  2 


(q)  lim  (x  +  5) 

.v— s>0+ 


1 

x  2 


(r)  lim 

x — >2 


x3  —  6x  —  2 
x3  -  4.1 


Exercise  3.5.2  The  function  fix )  = 

vx2  3“  1 

sketch  off  with  its  horizontal  asymptotes. 


has  two  horizontal  asymptotes.  Find  them  and  give  a  rough 


Exercise  3.5.3  Find  the  vertical  asymptotes  off(x)  — - — . 

X  z 

Exercise  3.5.4  Suppose  that  a  falling  object  reaches  velocity  v(t)  =  50(  1  —  e~f5 )  at  time  t,  where  distance 
is  measured  in  m  and  time  s.  What  is  the  object’s  terminal  velocity,  i.e.  the  value  of  v{t )  as  t  goes  to 
infinity? 


Exercise  3.5.5  Find  the  slant  asymptote  of  /(x) 


x2  3-x3-  6 
x  —  3 


Exercise  3.5.6  Compute  the  following  limits. 

(a)  lim  (2.i3  —  x) 

X-^—oo 

(b)  lim  tan-1  (e*) 

x — y°° 

(c)  lim  tan_1(e*) 

X— >—  00 


(d)  lim 

x— >°° 


e*  +  x4 
x3  3-  5  lnx 


(e) 


..  2*  3- 5(3*) 


<fl 


lim 

X— >—  00 


2* +  5(3*) 
3(2*) -3* 
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(g)  lim  Jx\nx  [Hint:  Let  t  —l  lx] 


3.6  A  Trigonometric  Limit 


In  this  section  we  aim  to  compute  the  limit: 


sinx 
lim - 

x — >-0  X 


We  start  by  analyzing  the  graph  of  y  — 


sinx 

x 


y 


Notice  that  x  =  0  is  not  in  the  domain  of  this  function.  Nevertheless,  we  can  look  at  the  limit  as  x  ap¬ 
proaches  0.  From  the  graph  we  find  that  the  limit  is  1  (there  is  an  open  circle  at  x  =  0  indicating  0  is 
not  in  the  domain).  We  just  convinced  you  this  limit  formula  holds  true  based  on  the  graph,  but  how 
does  one  attempt  to  prove  this  limit  more  formally?  To  do  this  we  need  to  be  quite  clever,  and  to  employ 
some  indirect  reasoning.  The  indirect  reasoning  is  embodied  in  a  theorem,  frequently  called  the  squeeze 
theorem. 


This  theorem  can  be  proved  using  the  official  definition  of  limit.  We  won’t  prove  it  here,  but  point  out 
that  it  is  easy  to  understand  and  believe  graphically.  The  condition  says  that  /(x)  is  trapped  between  g(x) 
below  and  h(x)  above,  and  that  at  x  —  a,  both  g  and  h  approach  the  same  value.  This  means  the  situation 
looks  something  like  Figure  3.1. 

For  example,  imagine  the  blue  curve  is  /(x)  =  x2  sinf/r/x),  the  upper  (red)  and  lower  (green)  curves  are 
h(x)  =  x2  and  g(x)  =  — x2.  Since  the  sine  function  is  always  between  —1  and  1,  — x2  <  x2  sin(7t/x)  <  x2, 
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and  it  is  easy  to  see  that  lim^o  —  x2  =  0  =  lirn^o*2-  It  is  not  so  easy  to  see  directly  (i.e.  algebraically) 
that  limY^o*2sin(7r/.r)  =  0,  because  the  n/x  prevents  us  from  simply  plugging  in  x  =  0.  The  squeeze 
theorem  makes  this  “hard  limit”  as  easy  as  the  trivial  limits  involving  x2. 


To  compute  lim(sinx)/x,  we  will  find  two  simpler  functions  g  and  h  so  that  g(x)  <  (sin*)/*  <  h(x), 

and  so  that  lim^ogM  =  limx_s.o  h{x).  Not  too  surprisingly,  this  will  require  some  trigonometry  and 
geometry.  Referring  to  figure  3.2,  x  is  the  measure  of  the  angle  in  radians.  Since  the  circle  has  radius  1, 
the  coordinates  of  point  A  are  (cosx,  sinx) ,  and  the  area  of  the  small  triangle  is  (cosx  sinx)  / 2.  This  triangle 
is  completely  contained  within  the  circular  wedge-shaped  region  bordered  by  two  lines  and  the  circle  from 
(1,0)  to  point  A.  Comparing  the  areas  of  the  triangle  and  the  wedge  we  see  (cosxsinx)/2  <  x/2,  since 
the  area  of  a  circular  region  with  angle  0  and  radius  r  is  Or2 /2.  With  a  little  algebra  this  turns  into 
(sinx) /x  <  1/ cosx,  giving  us  the  h  we  seek. 


Figure  3.2:  Visualizing  sinx/x. 

To  find  g,  we  note  that  the  circular  wedge  is  completely  contained  inside  the  larger  triangle.  The  height 
of  the  triangle,  from  (1,0)  to  point  B,  is  tanx,  so  comparing  areas  we  get  x/2  <  (tanx)/2  =  sinx/ (2cosx). 
With  a  little  algebra  this  becomes  cosx  <  (sinx)  /x.  So  now  we  have 

sinx  1 

cosx  < - < - . 

X  cosx 

Finally,  the  two  limits  limx_5.()cos*  and  lim^o  1/ cosx  are  easy,  because  cos(0)  =  1.  By  the  squeeze 
theorem,  lim^o^in^A  —  1  as  well. 
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Using  the  above,  we  can  compute  a  similar  limit: 


COSX— 1 
lim - 

x — ^0  X 


This  limit  is  just  as  hard  as  sinx/x,  but  closely  related  to  it,  so  that  we  don’t  have  to  do  a  similar  calculation; 
instead  we  can  do  a  bit  of  tricky  algebra. 

cosx— 1  cosx— 1  cosx+1  cos2x— 1  —  sin2x  sinx  sinx 

x  x  cosx+1  x(cosx+l)  x(cosx+l)  x  cosx+1 

To  compute  the  desired  limit  it  is  sufficient  to  compute  the  limits  of  the  two  final  fractions,  as  x  goes  to  0. 
The  first  of  these  is  the  hard  limit  we’ve  just  done,  namely  1.  The  second  turns  out  to  be  simple,  because 
the  denominator  presents  no  problem: 

sinx  sinO  0 

lim - = - =  -  =  0. 

jc-i0  cosx+1  cos0+l  2 


Thus, 


cosx— 1 
lim - 

x — ^0  X 


=  o. 


Solution.  We  have 


sin  5x  cosx 
lim - 

x — ^0  X 


5sin5xcosx 
lim - 

x — 


since  cos(0)  =  1  and  lim 

x — >-0 

Let’s  do  a  harder  one 


sin5x 

5x 

now. 


1. 


=  lim  5  cosx 

x— >0 


sin5x^ 
5x  ) 


=  5 ■ (1)  •  (1)  =  5 


* 
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Solution  Recall  that  the  tan3(2x)  means  that  tan(2x)  is  being  raised  to  the  third  power. 


tan3(2x 

lim 


=  lim 


(sin(2x)); 


x->o  x2  sin(7x)  x->-0  x2  sin(7x)  cos3  (2x) 


Rewrite  in  terms  of  sin  and  cos 


=  lim 


(2* 


3  (  sin  (2.x:) 
2x 


^°x2(7x)(^M)cos3(2x) 


Make  sine  terms  look  like: 


sin0 

~0~ 


8x3(1)3 


=  lim 


x->o  7x3(1)(13) 


sin/ijc 

Replace  lim- - 1  with  1.  Also,  cos(0)  =  1. 

x->0  nx 


lim  - 

x— >0  7 


Cancel  x3’s. 


8 

7' 

* 


Example  3.40:  Applying  the  Squeeze  Theorem 

Compute  the  following  limit:  lim  x3  cos  1 

x->0+ 

(  1  \ 
KVi) 

Solution.  We  use  the  Squeeze  Theorem  to  evaluate  this  limit.  We  know  that  cos  a  satisfies  —  1  <  cos  a  <  1 
for  any  choice  of  a.  Therefore  we  can  write: 

“1-cos(^)-1 

Since  x  — >  0+  implies  x  >  0,  multiplying  by  x3  gives: 

—x3  <  x3  cos  (  <  x3 . 


lim  (—x3)  <  lim  (  x3cos 

x->0+  x->0+  \ 


But  using  our  rules  we  know  that 


<  lim  x3. 


x->0+ 


lim  (—x3)  =  0, 


lim  x3  =  0 

x->0+ 


and  the  Squeeze  Theorem  says  that  the  only  way  this  can  happen  is  if 


lim  x3cos 

x->0+ 


0. 


X 
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When  solving  problems  using  the  Squeeze  Theorem  it  is  also  helpful  to  have  the  following  theorem. 


Exercises  for  3.6 


Exercise  3.6.1  Compute  the  following  limits. 


(a)  lim 

x — ^0 


sin(5x) 

x 


,b,  limS±nilx) 


jc->0  sin(2x) 


cot  (4c) 

(c)  lim - — — 

csc(3x) 


(d)  lim 

x — >0 


tanx 

x 


(e) 


lim 

X-¥7t/  4 


sinx  — cosx 
cos(2x) 


Exercise  3.6.2  For  all  x  >  0,  4x  —  9  <  /(x)  <  x2  —  4x  +  7.  Find  lim  /(x). 

x->4 

Exercise  3.6.3  For  all  x,  2x  <  g(x)  <  x4  —  x2  +  2.  Find  lim  g(x). 

x— >1 

Exercise  3.6.4  Use  the  Squeeze  Theorem  to  show  that  limx4cos(2/x)  =  0. 

x — 


Exercise  3.6.5  Find  the  value  of  1  i  mx_>00 


3x  +  sinx 
x  +  cosx 


Justify  your  steps  carefully. 


3.7  Continuity 


The  graph  shown  in  Figure  3.3(a)  represents  a  continuous  function.  Geometrically,  this  is  because  there 
are  no  jumps  in  the  graphs.  That  is,  if  you  pick  a  point  on  the  graph  and  approach  it  from  the  left  and  right, 
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the  values  of  the  function  approach  the  value  of  the  function  at  that  point.  For  example,  we  can  see  that 
this  is  not  true  for  function  values  near  x  —  1  on  the  graph  in  Figure  3.3(b)  which  is  not  continuous  at  that 
location. 


Figure  3.3:  (a)  A  continuous  function,  (b)  A  function  with  a  discontinuity  atx  —  1. 


Some  readers  may  prefer  to  think  of  continuity  at  a  point  as  a  three  part  definition.  That  is,  a  function 
f(x)  is  continuous  at  x  —  a  if  the  following  three  conditions  hold: 

(i)  f(a )  is  defined  (that  is,  a  belongs  to  the  domain  of  /), 

(ii)  lim/(x)  exists  (that  is,  left-hand  limit  =  right-hand  limit), 

x — Ya 

(iii)  lim/(x)  =  f(a)  (that  is,  the  numbers  from  (i)  and  (ii)  are  equal). 

x — \a 

The  figures  below  show  graphical  examples  of  functions  where  either  (i),  (ii)  or  (iii)  can  fail  to  hold. 
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continuous  at  x  =  a 

( i^a^)  =  f^) 


(i)  fails  to  hold 


lim  f(x)  does  not  exist 

x— «►  a 

(ii)  fails  to  hold 


(iii)  fails  to  hold 


On  the  other  hand,  if  /  is  defined  on  an  open  interval  containing  a,  except  perhaps  at  a,  we  can  say 
that  /  is  discontinuous  at  a  if  /  is  not  continuous  at  a. 

Graphically,  you  can  think  of  continuity  as  being  able  to  draw  your  function  without  having  to  lift  your 
pencil  off  the  paper.  If  your  pencil  has  to  jump  off  the  page  to  continue  drawing  the  function,  then  the 
function  is  not  continuous  at  that  point.  This  is  illustrated  in  Figure  3.3(b)  where  if  we  tried  to  draw  the 
function  (from  left  to  right)  we  need  to  lift  our  pencil  off  the  page  once  we  reach  the  point  x  —  1  in  order 
to  be  able  to  continue  drawing  the  function. 
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Definition  3.43:  Continuity  on  an  Open  Interval 


A  function  f  is  continuous  on  an  open  interval  ( a,b )  if  it  is  continuous  at  every  point  in  the 
interval. 

Furthermore,  a  function  is  everywhere  continuous  if  it  is  continuous  on  the  entire  real  number  line 

( — °°, °°) 


Recall  the  function  graphed  in  a  previous  section  as  shown  in  Figure  3.4. 


We  can  draw  this  function  without  lifting  our  pencil  except  at  the  points  x  —  —5,  x  —  —2,  x  =  —  1,  and 
x  —  4.  Thus,  f(x)  is  continuous  at  every  real  number  except  at  these  four  numbers.  At  x  —  —5,  x  —  —2, 
x——l,  and  x  —  4,  the  function  f(x)  is  discontinuous. 

At  x  —  —2  we  have  a  removable  discontinuity  because  we  could  remove  this  discontinuity  simply  by 
redefining  /(— 2)  to  be  3.5.  At  x  —  —5  and  x  —  —  1  we  have  jump  discontinuities  because  the  function 
jumps  from  one  value  to  another.  From  the  right  of  x  —  4,  we  have  an  infinite  discontinuity  because  the 
function  goes  off  to  infinity. 

Formally,  we  say  f(x)  has  a  removable  discontinuity  at  x  =  a  if  I  i  mA  fix)  exists  but  is  not  equal  to 
f{a).  Note  that  we  do  not  require  f(a)  to  be  defined  in  this  case,  that  is,  a  need  not  belong  to  the  domain 
of  f{x). 
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lim  f(x)  #  lim  f(x) 
x^a+  x— -a- 

JUMP  DISCONTINUITY 


Either  lim  f(x)  =  ±  co  or  lim  f(x)  =  ±  co 

x^a+  x^a~ 

INFINITE  DISCONTINUITY 


Solution.  In  order  to  be  continuous  at  2  we  require 


=/(2) 

x— >2 


to  hold.  We  use  the  three  part  definition  listed  previously  to  check  this. 

1.  First,  /( 2)  =  c,  and  c  is  some  real  number.  Thus,  /( 2)  is  defined. 

2.  Now,  we  must  evaluate  the  limit.  Rather  than  computing  both  one-sided  limits,  we  just  compute  the 

2_  _o 

limit  directly.  For  x  close  to  2  (but  not  equal  to  2)  we  can  replace  /(x)  with  A  to  get: 


lim  fix)  —  lim 

x^2  x—>2 


x2  —  x  —  2 
x  —  2 


]im(x-2)(x±l) 
x — >2  X  2 


=  lim(x-|-  1) 

.r— 


3. 


Therefore  the  limit  exists  and  equals  3. 

3.  Finally,  for  /  to  be  continuous  at  2,  we  need  that  the  numbers  in  the  first  two  items  to  be  equal. 
Therefore,  we  require  c  —  3.  Thus,  when  c  —  3,  /(x)  is  continuous  at  2,  for  any  other  value  of  c,  /(x)  is 
discontinuous  at  2.  X 

For  continuity  on  a  closed  interval,  we  consider  the  one-sided  limits  of  a  function.  Recall  that  x  — >  a~ 
means  x  approaches  a  from  values  less  than  a. 
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If  a  function  /  is  continuous  at  a ,  then  it  is  both  left  and  right  continuous  at  a. 

The  above  definition  regarding  left  (or  right)  continuous  functions  is  illustrated  with  the  following 
figure: 


One-sided  limits  allows  us  to  extend  the  definition  of  continuity  to  closed  intervals.  The  following 
definition  means  a  function  is  continuous  on  a  closed  interval  if  it  is  continuous  in  the  interior  of  the 
interval  and  possesses  the  appropriate  one-sided  continuity  at  the  endpoints  of  the  interval. 


This  definition  can  be  extended  to  continuity  on  half-open  intervals  such  as  (a,b\  and  [, a,b ),  and  un¬ 
bounded  intervals. 
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Example  3.47:  Continuity  on  Other  Intervals 


The  function  f(x)  —  sjx  is  continuous  on  the  (closed)  interval  [0,°°). 

The  function  f(x)  —  y/A  —  x  is  continuous  on  the  (closed)  interval  (— °°,4]. 


The  continuity  of  functions  is  preserved  under  the  operations  of  addition,  subtraction,  multiplication 
and  division  (in  the  case  that  the  function  in  the  denominator  is  nonzero). 


Below  we  list  some  common  functions  that  are  known  to  be  continuous  on  every  interval  inside  their 
domains. 


Example  3.49:  Common  Types  of  Continuous  Functions 


•  Polynomials  (for  all  x),  c.g.,  y  —  mx  +  b,y  —  ax2  +  bx  +  c. 

•  Rational  functions  (except  at  points  x  which  gives  division  by  zero). 

•  Root  functions  f/x  (for  all  x  if  n  is  odd,  and  for  x  >  0  if  n  is  even). 

•  Trigonometric  functions 

•  Inverse  trigonometric  functions 

•  Exponential  funtions 

•  Logarithmic  functions 

For  rational  functions  with  removable  discontinuities  as  a  result  of  a  zero,  we  can  define  a  new  function 
Filing  in  these  gaps  to  create  a  piecewise  function  that  is  continuous  everywhere. 

Continuous  functions  are  where  the  direct  substitution  property  hold.  This  fact  can  often  be  used  to 
compute  the  limit  of  a  continuous  function. 


Solution.  We  will  use  a  continuity  argument  to  justify  that  direct  substitution  can  be  applied.  By  the  list 
above,  yfx,  sinx,  1,  x  and  cosx  are  all  continuous  functions  at  K.  Then  y/x  +  sinx  and  1  +  jc  +  cosx  are  both 
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continuous  at  n.  Finally, 


i/v  +  sinx 

1  +  X  +  COSX 

is  a  continuous  function  at  k  since  1  +  %  +  cos  n  ^  0.  Hence,  we 


can  directly  substitute  to  get  the  limit: 


V^+Sin*  JK  +  smit 

lim - = - 

x-+K  1+x  +  cosx  l  +  ;r  +  cos7r 


\fn  _  1 

71  y/n' 


X 


Continuity  is  also  preserved  under  the  composition  of  functions. 


Theorem  3.51:  Continuity  of  Function  Composition 


If  g  is  continuous  at  a  and  f  is  continuous  at  g(a),  then  the  composition  function  f  o  g  is  continuous 
at  a. 


Solution. 


(a)  The  functions  that  make  up  the  composition  h(x)  —  f(g(x))  are  g(x)  —  x2  and  f(x)  —  cos(x).  The 
function  g  is  continuous  on  R  since  it  is  a  polynomial,  and  /  is  also  continuous  everywhere.  Therefore, 
h(x)  —  (fog)(x)  is  continuous  on  R  by  Theorem  3.51. 

(b)  We  know  from  Example  3.49  that  f(x)  —  lnx  is  continuous  and  g(x)  =  1  +  sinx  are  continuous.  Thus 
by  Theorem  3.51,  H(x)  —  f(g(x))  is  continuous  wherever  it  is  defined.  Now  ln(l  +sinx)  is  defined 
when  1  +  sinx  >  0.  Recall  that  —  1  <  sinx  <  1 ,  so  1  +  sinx  >  0  except  when  sinx  =  —  1 ,  which  happens 

when x  =  ±3n/2, ±7n/2, Therefore,  H  has  discontinuities  when x  =  3nn/2,  n  —  1,2,3,...  and  is 

continuous  on  the  intervals  between  these  values. 


X 


Intermediate  Value  Theorem 


Whether  or  not  an  equation  has  a  solution  is  an  important  question  in  mathematics.  Consider  the  following 
two  questions: 
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Solution. 


1.  The  first  question  is  easy  to  answer  since  for  any  exponential  function  we  know  that  ax  >  0,  and 
we  also  know  that  whenever  you  square  a  number  you  get  a  nonnegative  answer:  x2  >  0.  Hence, 
e*  +  x2  >  0,  and  thus,  is  never  equal  to  zero.  Therefore,  the  first  equation  has  no  solution. 


2.  For  the  second  question,  it  is  difficult  to  see  if  e*  +  x  —  0  has  a  solution.  If  we  tried  to  solve  for  x, 
we  would  run  into  problems.  Let’s  make  a  table  of  values  to  see  what  kind  of  values  we  get  (recall 
that  e  ~  2.7183): 


x 

ex  +  x 

-2 

e~2  —  2  &  — 1.9 

-1 

e-!-lfti-0.6 

0 

e°  +  0  =  1 

1 

e+  1  «  3.7 

Sketching  this  gives: 


y 

▼  1  1  1 

p 

OJ 

• 

I  1  1 

X 

(-1, -0.6)  - 

• 

(-2, -1.9) 

Let  f{x)  —  ex  +  x.  Notice  that  if  we  choose  a  =  —  1  and  b  =  0  then  we  have  f(a )  <  0  and  f(b)  >  0. 
A  point  where  the  function  f(x)  crosses  the  x-axis  gives  a  solution  to  ex  +  x  —  0.  Since  f(x)  =  ex+x 
is  continuous  (both  ex  and  x  are  continuous),  then  the  function  must  cross  the  x-axis  somewhere 
between  —  1  and  0: 


Therefore,  our  equation  has  a  solution. 
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Note  that  by  looking  at  smaller  and  smaller  intervals  (a,b)  with  f(a )  <  0  and  f(b)  >  0,  we  can 
get  a  better  and  better  approximation  for  a  solution  to  e*  +  x  —  0.  For  example,  taking  the  interval 
(—0.4,  —0.6)  gives  /(— 0.4)  <  0  and  /(— 0.6)  >  0,  thus,  there  is  a  solution  to  /(x)  —  0  between  —0.4 
and  —0.6.  It  turns  out  that  the  solution  to^+x^Oisxfs  —0.56714. 


X 

We  now  generalize  the  argument  used  in  the  previous  example.  In  that  example  we  had  a  continuous 
function  that  went  from  negative  to  positive  and  hence,  had  to  cross  the  x-axis  at  some  point.  In  fact,  we 
don’t  need  to  use  the  x-axis,  any  line  y  =  N  will  work  so  long  as  the  function  is  continuous  and  below  the 
line  y  —  N  at  some  point  and  above  the  line  y  —  N  at  another  point.  This  is  known  as  the  Intermediate 
Values  Theorem  and  it  is  formally  stated  as  follows: 


The  Intermediate  Value  Theorem  guarantees  that  if  /(x)  is  continuous  and  f(a)  <  N  <  /(/?),  the  line 
y  —  N  intersects  the  function  at  some  point  x  —  c.  Such  a  number  c  is  between  a  and  b  and  has  the  property 
that  /(c)  =  N  (see  Figure  3.5(a)).  We  can  also  think  of  the  theorem  as  saying  if  we  draw  the  line  y  —  N 
between  the  lines  y  —  f(a )  and  y  =  f(b),  then  the  function  cannot  jump  over  the  line  y  =  N.  On  the  other 
hand,  if  /(x)  is  not  continuous,  then  the  theorem  may  not  hold.  See  Figure  3.5(b)  where  there  is  no  number 
c  in  (a,b)  such  that  /(c)  =  N.  Finally,  we  remark  that  there  may  be  multiple  choices  for  c  (i.e.,  lots  of 
numbers  between  a  and  b  with  y-coordinate  N).  See  Figure  3.5(c)  for  such  an  example. 


Figure  3.5:  (a)  A  continuous  function  where  IVT  holds  for  a  single  value  c.  (b)  A  discontinuous 
function  where  IVT  fails  to  hold,  (c)  A  continuous  function  where  IVT  holds  for  multiple 
values  in  (a,b). 


The  Intermediate  Value  Theorem  is  most  frequently  used  for  N  —  0. 


Solution.  Let  /(x)  =  ^/x+x—  1,  N  —  0,  a  —  0,  and  b  —  8.  Since  yfx,  x  and  —1  are  continuous  on  R,  and  the 
sum  of  continuous  functions  is  again  continuous,  we  have  that  /(x)  is  continuous  on  R,  thus  in  particular, 
/(x)  is  continuous  on  [0, 8] .  We  have  /(a)  =  /( 0)  =  s/0  +  0  —  1  =  —  1  and  f{b)  —  f( 8)  =  s/8  +  8  —  1=9. 
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Thus  N  —  0  lies  between  f(a)  =  —  1  and  f(b)  =  9,  so  the  conditions  of  the  Intermediate  Value  Theorem  are 
satisfied.  So,  there  exists  a  number  c  in  (0, 8)  such  that  /(c)  =  0.  This  means  that  c  satisfies  c  +  c —  1=0, 
in  otherwords,  is  a  solution  for  the  equation  given. 

Alternatively  we  can  let  f(x)  —  ^x+x,  N— l, a  — 0  and  b  —  8.  Then  as  before  f(x)  is  the  sum  of  two 
continuous  functions,  so  is  also  continuous  everywhere,  in  particular,  continuous  on  the  interval  [0, 8] .  We 
have  f(a)  —  /( 0)  =  v^O  +  0  =  0  and  f{b)  —  /( 8)  =  -^8  +  8  =  10.  Thus  N  —  1  lies  between  /(a)  =  0  and 
f(b)  —  10,  so  the  conditions  of  the  Intermediate  Value  Theorem  are  satisfied.  So,  there  exists  a  number 
c  in  (0,8)  such  that  /(c)  =  1.  This  means  that  c  satisfies  y/ c  +  c  =  1,  in  otherwords,  is  a  solution  for  the 
equation  given.  X 


Solution.  By  theorem  3.8,  /  is  continuous.  Since  /( 0)  =  —2  and  /( 1)  =  3,  and  0  is  between  —2  and  3, 
there  is  a  c  G  (0, 1)  such  that  /(c)  =0.  X 

This  example  also  points  the  way  to  a  simple  method  for  approximating  roots. 


Example  3.57:  Approximating  Roots 


Approximate  the  root  of  the  previous  example  to  one  decimal  place. 


Solution.  If  we  compute  /( 0.1),  /( 0.2),  and  so  on,  we  find  that  /( 0.6)  <  0  and  /(0.7)  >  0,  so  by  the 
Intermediate  Value  Theorem,  /  has  a  root  between  0.6  and  0.7.  Repeating  the  process  with  /(0.61), 
/(0.62),  and  so  on,  we  find  that  /(0.61)  <  0  and  /( 0.62)  >  0,  so  /  has  a  root  between  0.61  and  0.62,  and 
the  root  is  0.6  rounded  to  one  decimal  place.  4k 


Exercises  for  3.7 


Exercise  3.7.1  Consider  the  function 


h{x) 


2x  —  3,  if  x  <  1, 
0,  ifx  >  1. 


Show  that  it  is  continuous  at  the  point  x  =  0.  Is  h  a  continuous  function? 


Exercise  3.7.2  Find  the  values  of  a  that  make  the  function  f(x)  continuous  for  all  real  numbers. 


f  4x  +  5,  ifx>—  2, 
\  x2  +  a,  ifx  <  —2. 
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Exercise  3.7.3  Find  the  values  of  the  constant  c  so  that  the  function  g(x)  is  continuous  on  (— < °°, 


$(*)  = 


2  —  2  c2x,  if  x  <  —  1, 
6  — lex2,  ifx>—  1. 


Exercise  3.7.4 
Exercise  3.7.5 
Exercise  3.7.6 


a  9 

Approximate  a  root  of  f  —  x  —  4x  +  2x  +  2  to  one  decimal  place. 
Approximate  a  root  of  f  —  x4  +  .x:3  —  5x+  1  to  one  decimal  place. 

Show  that  the  equation  Ifx  +  x  =  1  has  a  solution  in  the  interx’al  (0,8). 


°°),  where 


4.  Derivatives 


4.1  The  Rate  of  Change  of  a  Function 


Suppose  that  y  is  a  function  of  x,  say  y  —  f(x) .  It  is  often  useful  to  know  how  sensitive  the  value  of  y  is  to 
small  changes  in  x. 


Example  4.1:  Small  Changes  in  x 


Consider  y  —  /(x)  =  \f 625  —  x2  (the  upper  semicircle  of  radius  25  centered  at  the  origin),  and  let’s 
compute  the  changes  ofy  resulting  from  small  changes  of  x  around  x  =  7. 


Solution.  When  x  =  7,  we  find  that  y  —  \/ 625  —  49  =  24.  Suppose  we  want  to  know  how  much  y  changes 
when  x  increases  a  little,  say  to  7.1  or  7.01. 

In  the  case  of  a  straight  line  y  —  mx  +  b,  the  slope  m  —  Ay/ Ax-  measures  the  change  in  y  per  unit  change 
in  x.  This  can  be  interpreted  as  a  measure  of  “sensitivity”;  for  example,  if  y  =  lOOx  +  5,  a  small  change  in 
x  corresponds  to  a  change  one  hundred  times  as  large  in  y,  so  y  is  quite  sensitive  to  changes  in  x. 

Let  us  look  at  the  same  ratio  Ay/Ax  for  our  function  y  —  f(x)  —  \J 625  —  x2  when  x  changes  from  7  to 
7.1.  Here  Ax  =  7.1  —  7  =  0.1  is  the  change  in  x,  and 

Ay  =  /(x  +  Ax)— /(x)  =  /(7.1)  — /(7) 

=  a/ 625  —  7. 12  —  a/ 625  —  72 
^23.9706-24  =  -0.0294. 


Thus,  Ay/Ax  «  —0.0294/0.1  =  —0.294.  This  means  that  y  changes  by  less  than  one  third  the  change  in  x, 
so  apparently  y  is  not  very  sensitive  to  changes  in  x  at  x  =  7.  We  say  “apparently”  here  because  we  don’t 
really  know  what  happens  between  7  and  7.1.  Perhaps  y  changes  dramatically  as  x  runs  through  the  values 
from  7  to  7.1,  but  at  7.1  y  just  happens  to  be  close  to  its  value  at  7.  This  is  not  in  fact  the  case  for  this 
particular  function,  but  we  don’t  yet  know  why.  X 

The  quantity  Ay/Ax  ~  —0.294  may  be  interpreted  as  the  slope  of  the  line  through  (7,24)  and  (7.1,23.9706), 
called  a  chord  of  the  circle.  In  general,  if  we  draw  the  chord  from  the  point  (7,24)  to  a  nearby  point  on 
the  semicircle  (7  +  Ax,  /(7  + Ax)),  the  slope  of  this  chord  is  the  so-called  difference  quotient 

/(7  +  Ax)-/(7)  _  \J 625  —  (7  +  Ay)2  —  24 
Ax  Ax 
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For  example,  if  x  changes  only  from  7  to  7.01,  then  the  difference  quotient  (slope  of  the  chord)  is  ap¬ 
proximately  equal  to  (23.997081  —  24)/0.01  =  —0.2919.  This  is  slightly  different  than  for  the  chord  from 
(7,24)  to  (7.1,23.9706). 

As  Ax  is  made  smaller  (closer  to  0),  7  +  Ax  gets  closer  to  7  and  the  chord  joining  (7,/(7))  to  (7  + 
Ax,/(7  +  Ax))  shifts  slightly,  as  shown  in  Figure  4.1.  The  chord  gets  closer  and  closer  to  the  tangent  line 
to  the  circle  at  the  point  (7,24).  (The  tangent  line  is  the  line  that  just  grazes  the  circle  at  that  point,  i.e.,  it 
doesn’t  meet  the  circle  at  any  second  point.)  Thus,  as  Ax  gets  smaller  and  smaller,  the  slope  Ay /Ax  of  the 
chord  gets  closer  and  closer  to  the  slope  of  the  tangent  line.  This  is  actually  quite  difficult  to  see  when  Ax¬ 
is  small,  because  of  the  scale  of  the  graph.  The  values  of  Ax  used  for  the  figure  are  1,  5,  10  and  15,  not 
really  very  small  values.  The  tangent  line  is  the  one  that  is  uppermost  at  the  right  hand  endpoint. 


Figure  4.1:  Chords  approximating  the  tangent  line. 


So  far  we  have  found  the  slopes  of  two  chords  that  should  be  close  to  the  slope  of  the  tangent  line,  but 
what  is  the  slope  of  the  tangent  line  exactly?  Since  the  tangent  line  touches  the  circle  at  just  one  point,  we 
will  never  be  able  to  calculate  its  slope  directly,  using  two  “known”  points  on  the  line.  What  we  need  is  a 
way  to  capture  what  happens  to  the  slopes  of  the  chords  as  they  get  “closer  and  closer”  to  the  tangent  line. 

Instead  of  looking  at  more  particular  values  of  Ax,  let’s  see  what  happens  if  we  do  some  algebra  with  the 
difference  quotient  using  just  Ax.  The  slope  of  a  chord  from  (7,24)  to  a  nearby  point  (7  +  Ax,/(7  +  Ax)) 
is  given  by 


/(7  +  Ax)-/(7) 
Ax 


yj 625  —  (7  +  Ax)2  —  24 
Ax 

( yj 625  —  (7  +  Ax)2  —  24\  f  ^625  -  (7  +  Ax)2 +  24\ 
V  Ax  J  1^625  — (7  + Ax) 2 +  24  J 

625 -(7  + Ax)2 -242 
Ax(  yj 625  —  (7  +  Ax)2  +  24) 

49  —  49  —  14Ax  —  Ax2 
Ax(  yj 625  —  (7  +  Ax)2  +  24) 

Ax(— 14  — Ax) 

Ax(  yj 625  —  (7  +  Ax)2  +  24) 
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-14-Ax 

~~  \J 625  —  (7  +  At)2  +  24 

Now,  can  we  tell  by  looking  at  this  last  formula  what  happens  when  At  gets  very  close  to  zero?  The 
numerator  clearly  gets  very  close  to  —14  while  the  denominator  gets  very  close  to  \J 625  —  72  +  24  =  48. 
The  fraction  is  therefore  very  close  to  —14/48  =  —7/24  =  —0.29167.  In  fact,  the  slope  of  the  tangent  line 
is  exactly  —7/24. 

What  about  the  slope  of  the  tangent  line  at  x  —  12?  Well,  12  can’t  be  all  that  different  from  7;  we  just 
have  to  redo  the  calculation  with  12  instead  of  7.  This  won’t  be  hard,  but  it  will  be  a  bit  tedious.  What  if 
we  try  to  do  all  the  algebra  without  using  a  specific  value  for  x?  Let’s  copy  from  above,  replacing  7  by  x. 

f(x  4-  At)  —  f(x)  \J 625  —  (x  + At)2  —  \J 625  —  x2 

At  At 

\J 625  —  (x  +  At)2  —  f 625  —  x2  a/ 625  —  (x  +  At)2  +  \J 625  —  x2 
Ax  a/ 625  —  (x  +  At)2  +  \/ 625  —  x2 

625  —  (x  +  At)2  —  625  +x2 
Ax(  ^/625  —  (x  + At)2  +  V 625  —  x2) 

625  —  x2  —  2xAx  —  At2  —  625  +  x2 
Ax(  \/625  —  (x  + At)2  +  V 625  —  x2) 

Ax( — 2x  —  At) 

At(  \/625  —  (x  + At)2  +  t/625  —  x2) 

— 2x  —  At 

a/ 625  —  (x  +  At)2  +  t/ 625  —  x2 


Now  what  happens  when  At  is  very  close  to  zero?  Again  it  seems  apparent  that  the  quotient  will  be 
very  close  to 

— 2x  — 2x  —x 


V 625  —  x2  +  t/ 625  —  x2  2^625 -x2  V625-. 


Replacing  x  by  7  gives  —7/24,  as  before,  and  now  we  can  easily  do  the  computation  for  12  or  any  other 
value  of  x  between  —25  and  25. 

So  now  we  have  a  single  expression,  — x/  \J 625  —  x2,  that  tells  us  the  slope  of  the  tangent  line  for  any 
value  of  x.  This  slope,  in  turn,  tells  us  how  sensitive  the  value  of  y  is  to  small  changes  in  the  value  of  x. 

The  expression  — x/  \J 625  —  x2  defines  a  new  function  called  the  derivative  of  the  original  function 
(since  it  is  derived  from  the  original  function).  If  the  original  is  referred  to  as  /  or  y  then  the  derivative 
is  often  written  f  or  y'  (pronounced  “f  prime”  or  “y  prime”).  So  in  this  case  we  might  write  f\x)  — 
— x/ \/ 625  —  x2  or  y'  —  —xj \J 625  —  x2.  At  a  particular  point,  say  x  =  7,  we  write  /'(7)  =  — 7/24  and  we 
say  that  “/  prime  of  7  is  —7/ 24”  or  “the  derivative  of  /  at  7  is  —7/ 24.” 

To  summarize,  we  compute  the  derivative  of  /(x)  by  forming  the  difference  quotient 


/(x  +  At)  -fjx) 
At 


which  is  the  slope  of  a  line,  then  we  figure  out  what  happens  when  At  gets  very  close  to  0. 


118  Derivatives 


At  this  point,  we  should  note  that  the  idea  of  letting  Ax  get  closer  and  closer  to  0  is  precisely  the  idea 
of  a  limit  that  we  discussed  in  the  last  chapter.  The  limit  here  is  a  limit  as  Ax  approaches  0.  Using  limit 
notation,  we  can  write  f'(x)  —  lim^^o  . 

In  the  particular  case  of  a  circle,  there’s  a  simple  way  to  find  the  derivative.  Since  the  tangent  to  a  circle 
at  a  point  is  perpendicular  to  the  radius  drawn  to  the  point  of  contact,  its  slope  is  the  negative  reciprocal 
of  the  slope  of  the  radius.  The  radius  joining  (0,0)  to  (7,24)  has  slope  24/7.  Hence,  the  tangent  line  has 
slope  —7/24.  In  general,  a  radius  to  the  point  (x,  \f 625  —  x2)  has  slope  \J 625  —  x2/x,  so  the  slope  of  the 
tangent  line  is  — x/  \J 625  —  x2,  as  before.  It  is  NOT  always  true  that  a  tangent  line  is  perpendicular  to  a 
line  from  the  origin — don’t  use  this  shortcut  in  any  other  circumstance. 

As  above,  and  as  you  might  expect,  for  different  values  of  x  we  generally  get  different  values  of  the 
derivative  fix).  Could  it  be  that  the  derivative  always  has  the  same  value?  This  would  mean  that  the  slope 
of  /,  or  the  slope  of  its  tangent  line,  is  the  same  everywhere.  One  curve  that  always  has  the  same  slope  is 
a  line;  it  seems  odd  to  talk  about  the  tangent  line  to  a  line,  but  if  it  makes  sense  at  all  the  tangent  line  must 
be  the  line  itself.  It  is  not  hard  to  see  that  the  derivative  of  /(x)  =  mx  +  b  is  f (x)  =  in. 

Velocity 


We  started  this  section  by  saying,  “It  is  often  useful  to  know  how  sensitive  the  value  of  y  is  to  small  changes 
in  x.”  We  have  seen  one  purely  mathematical  example  of  this,  involving  the  function  /(x)  =  \J 625  —  x2. 
Here  is  a  more  applied  example. 

With  careful  measurement  it  might  be  possible  to  discover  that  the  height  of  a  dropped  ball  t  seconds 
after  it  is  released  is  h(t)  =  ho  —  kt2.  (Here  ho  is  the  initial  height  of  the  ball,  when  t  —  0,  and  k  is  some 
number  determined  by  the  experiment.)  A  natural  question  is  then,  “How  fast  is  the  ball  going  at  time 
f  ?”  We  can  certainly  get  a  pretty  good  idea  with  a  little  simple  arithmetic.  To  make  the  calculation  more 
concrete,  let’s  use  units  of  meters  and  seconds  and  say  that  Jiq  —  100  meters  and  k  =  4.9.  Suppose  we’re 
interested  in  the  speed  at  t  —  2.  We  know  that  when  t  —  2  the  height  is  100  —  4  •  4.9  =  80.4  meters.  A 
second  later,  at  t  —  3,  the  height  is  100  —  9  ■  4.9  =  55.9  meters.  The  change  in  height  during  that  second  is 
55.9  —  80.4  =  —24.5  meters.  The  negative  sign  means  the  height  has  decreased,  as  we  expect  for  a  falling 
ball,  and  the  number  24.5  is  the  average  speed  of  the  ball  during  the  time  interval,  in  meters  per  second. 

We  might  guess  that  24.5  meters  per  second  is  not  a  terrible  estimate  of  the  speed  at  t  —  2,  but  certainly 
we  can  do  better.  At  t  =  2.5  the  height  is  100  —  4.9(2.5)2  =  69.375  meters.  During  the  half  second  from 
t  =  2  to  t  =  2.5,  the  change  in  height  is  69.375  —  80.4  =  —11.025  meters  giving  an  average  speed  of 
11.025/(1/2)  =  22.05  meters  per  second.  This  should  be  a  better  estimate  of  the  speed  at  t  =  2.  So  it’s 
clear  now  how  to  get  better  and  better  approximations:  compute  average  speeds  over  shorter  and  shorter 
time  intervals.  Between  t  —  2  and  t  —  2.01,  for  example,  the  ball  drops  0.19649  meters  in  one  hundredth 
of  a  second,  at  an  average  speed  of  19.649  meters  per  second. 

We  still  might  reasonably  ask  for  the  precise  speed  at  t  =  2  (the  instantaneous  speed)  rather  than  just 
an  approximation  to  it.  For  this,  once  again,  we  need  a  limit.  Let’s  calculate  the  average  speed  during  the 
time  interval  from  t  — 2  to  t  —  2-\- At  without  specifying  a  particular  value  for  At.  The  change  in  height 
during  the  time  interval  from  t  =  2  to  t  =  2  +  At  is 

h{2  +  At)  —  h(2)  =  (100 -4.9(2  + At)2)  -80.4 

-  100  -  4.9(4  +  4At  +  At2)  -  80.4 
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-  100  -  19.6  -  19.6Af  -  4.9Ar  -  80.4 
=  -19.6At-4.9Ar 
=  — At(19.6  +  4.9At) 

The  average  speed  during  this  time  interval  is  then 

M1M  +  4.9*) 

At 


When  At  is  very  small,  this  is  very  close  to  19.6.  Indeed,  liniAx->.o(19.6  +  4.9At)  =  19.6.  So  the  exact 
speed  at  t  =  2  is  19.6  meters  per  second. 

At  this  stage  we  need  to  make  a  distinction  between  speed  and  velocity.  Velocity  is  signed  speed,  that 
is,  speed  with  a  direction  indicated  by  a  sign  (positive  or  negative).  Our  algebra  above  actually  told  us  that 
the  instantaneous  velocity  of  the  ball  at  t  —  2  is  —19.6  meters  per  second.  The  number  19.6  is  the  speed 
and  the  negative  sign  indicates  that  the  motion  is  directed  downwards  (the  direction  of  decreasing  height). 

In  the  language  of  the  previous  section,  we  might  have  started  with  f(x)  —  1 00  —  4.9x2  and  asked  for 
the  slope  of  the  tangent  line  at  x  —  2.  We  would  have  answered  that  question  by  computing 


Um  /(2  +  Ax)  — /(2) 
Ax— S’O  Ax 


-19.6Ajc-4.9Ax2 

lim  - - - 

Ax— >0  Ax 


— 19.6  — 4.9Ax  =  19.6 


The  algebra  is  the  same.  Thus,  the  velocity  of  the  ball  is  the  value  of  the  derivative  of  a  certain  function, 
namely,  of  the  function  that  gives  the  position  of  the  ball. 

The  upshot  is  that  this  problem,  finding  the  velocity  of  the  ball,  is  exactly  the  same  problem  mathemat¬ 
ically  as  finding  the  slope  of  a  curve.  This  may  already  be  enough  evidence  to  convince  you  that  whenever 
some  quantity  is  changing  (the  height  of  a  curve  or  the  height  of  a  ball  or  the  size  of  the  economy  or 
the  distance  of  a  space  probe  from  earth  or  the  population  of  the  world)  the  rate  at  which  the  quantity  is 
changing  can,  in  principle,  be  computed  in  exactly  the  same  way,  by  finding  a  derivative. 


Exercises  for  Section  4.1 


Exercise  4.1.1  Draw  the  graph  of  the  function  y  —  /(x)  =  \J  169  — x2  between  x  —  0  and  x  —  13.  Find 
the  slope  Ay  /  Ax  of  the  chord  between  the  points  of  the  circle  lying  over  (a)  x  —  12  and  x  —  13,  (b)  x—  12 
and  x—  12.1,  (c)  x  —  12  and  x  =  12.01,  (d)  x  —  12  andx  —  12.001.  Now  use  the  geometry  of  tangent  lines 
on  a  circle  to  find  (e)  the  exact  value  of  the  derivative  f'(  12).  Your  answers  to  (a)-(d)  should  be  getting 
closer  and  closer  to  your  answer  to  (e). 

Exercise  4.1.2  Use  geometry  to  find  the  derivative  f  (x)  of  the  function  /(x)  =  sj 625  —  x2  in  the  text  for 
each  of  the  following  x:  (a)  20,  (b)  24,  (c)  —7,  (d)  —15.  Draw  a  graph  of  the  upper  semicircle,  and  draw 
the  tangent  line  at  each  of  these  four  points. 

Exercise  4.1.3  Draw  the  graph  of  the  function  y  —  /(x)  —  \/x  between  x  =1/2  and  x  —  4.  Find  the 
slope  of  the  chord  between  (a)  x  —  3  and  x  —  3.1,  (b)  x  —  3  and  x  —  3.01,  (c)  x  —  3  and  x  —  3.001.  Now 
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use  algebra  to  find  a  simple  formula  for  the  slope  of  the  chord  between  (3,/(3))  and  (3  +  Ax,/(3  +  Ax)). 
Determine  what  happens  when  Ax  approaches  0.  In  your  graph  ofy  —  l/x,  draw  the  straight  line  through 
the  point  (3, 1/3)  whose  slope  is  this  limiting  value  of  the  difference  quotient  as  A x  approaches  0. 

Exercise  4.1.4  Find  an  algebraic  expression  for  the  difference  quotient  (/(l  +  Ax)  — /(1))/Ax  when 
fix)  —  x2  —  (l/x).  Simplify  the  expression  as  much  as  possible.  Then  determine  what  happens  as  Ax 
approaches  0.  That  value  is  f'(  1). 

Exercise  4.1.5  Draw  the  graph  of  y  —  /(x)  =  x3  between  x  =  0  and  x  —  1.5.  Find  the  slope  of  the 
chord  between  (a)  x  —  1  and  x  =  1.1,  (b)  x  —  1  and  x  —  1.001,  (c)  x  =  1  and  x  =  1.00001.  Then  use 
algebra  to  find  a  simple  formula  for  the  slope  of  the  chord  between  1  and  1  +  Ax.  (Use  the  expansion 
(A  +  Bp  =  A  +  3 A  B  +  3 AB  +  IT  .)  Determine  what  happens  as  Ax  approaches  0,  and  in  your  graph  of 
y  —  x3  draw  the  straight  line  through  the  point  (1,1)  whose  slope  is  equal  to  the  value  you  just  found. 

Exercise  4.1.6  Find  an  algebraic  expression  for  the  difference  quotient  (/(x+Ax)  — /(x))/Ax  when  /(x)  = 
mx  +  b.  Simplify  the  expression  as  much  as  possible.  Then  determine  what  happens  as  Ax  approaches  0. 
That  value  is  f  (x) . 

Exercise  4.1.7  Sketch  the  unit  circle.  Discuss  the  behavior  of  the  slope  of  the  tangent  line  at  various 
angles  around  the  circle.  Which  trigonometric  function  gives  the  slope  of  the  tangent  line  at  an  angle  0  ? 
Why?  Hint:  think  in  terms  of  ratios  of  sides  of  triangles. 

Exercise  4.1.8  Sketch  the  parabola  y  =  x.  For  what  values  ofx  on  the  parabola  is  the  slope  of  the  tangent 
line  positive?  Negative?  What  do  you  notice  about  the  graph  at  the  point(s)  where  the  sign  of  the  slope 
changes  from  positive  to  negative  and  vice  versa? 

Exercise  4.1.9  An  object  is  traveling  in  a  straight  line  so  that  its  position  (that  is,  distance  from  some 
fixed  point)  is  given  by  this  table: 


time  (seconds) 

0 

1 

2 

3 

distance  (meters) 

0 

10 

25 

60 

Find  the  average  speed  of  the  object  during  the  following  time  intervals:  [0,1],  [0,2],  [0,3],  [1,2],  [1,3], 
[2,3].  If  you  had  to  guess  the  speed  at  t  =  2  just  on  the  basis  of  these,  what  would  you  guess? 

Exercise  4.1.10  Let  y  —  f{t)  —  t2,  where  t  is  the  time  in  seconds  and  y  is  the  distance  in  meters  that  an 
object  falls  on  a  certain  airless  planet.  Draw  a  graph  of  this  function  between  t  =  0  and  t  —  3.  Make  a 
table  of  the  average  speed  of  the  falling  object  between  (a)  2  sec  and  3  sec,  (b)  2  sec  and  2.1  sec,  (c)  2 
sec  and  2.01  sec,  (d)  2  sec  and  2.001  sec.  Then  use  algebra  to  find  a  simple  formula  for  the  average  speed 
between  time  2  and  time  2  + At.  (If  you  substitute  At  —  1,  0.1,  0.01,  0.001  in  this  formula  you  should  again 
get  the  answers  to  parts  (a)-(d).)  Next,  in  your  formula  for  average  speed  (which  should  be  in  simplified 
form)  determine  what  happens  as  At  approaches  zero.  This  is  the  instantaneous  speed.  Fincdly,  in  your 
graph  ofy  =  t2  draw  the  straight  line  through  the  point  (2,4)  whose  slope  is  the  instantaneous  velocity 
you  just  computed;  it  should  of  course  be  the  tangent  line. 

Exercise  4.1.11  If  an  object  is  dropped  from  an  80-meter  high  window,  its  height  y  above  the  ground  at 
time  t  seconds  is  given  by  the  formula  y  —  f(t )  =  80  —  4.9t2.  (Here  we  are  neglecting  cur  resistance;  the 


4.2.  The  Derivative  Function  121 


graph  of  this  function  was  shown  in  figure  1.1.)  Find  the  average  velocity  of  the  falling  object  between  (a) 
1  sec  and  1.1  sec,  (b)  1  sec  and  1.01  sec,  (c)  1  sec  and  1.001  sec.  Now  use  algebra  to  find  a  simple  formula 
for  the  average  velocity  of  the  falling  object  between  1  sec  and  1  +  At  sec.  Determine  what  happens  to 
this  average  velocity  as  At  approaches  0.  That  is  the  instantaneous  velocity  at  time  t  —  1  second  ( it  will 
be  negative,  because  the  object  is  falling). 


4.2  The  Derivative  Function 


In  Section  4.1,  we  have  seen  how  to  create,  or  derive,  a  new  function  /'(x)  from  a  function  fix),  and 
that  this  new  function  carries  important  information.  In  one  example  we  saw  that  f'{x )  tells  us  how  steep 
the  graph  of  f(x)  is;  in  another  we  saw  that  f (x)  tells  us  the  velocity  of  an  object  if  f(x)  tells  us  the 
position  of  the  object  at  time  x.  As  we  said  earlier,  this  same  mathematical  idea  is  useful  whenever  f(x) 
represents  some  changing  quantity  and  we  want  to  know  something  about  how  it  changes,  or  roughly,  the 
“rate”  at  which  it  changes.  Most  functions  encountered  in  practice  are  built  up  from  a  small  collection  of 
“primitive”  functions  in  a  few  simple  ways,  for  example,  by  adding  or  multiplying  functions  together  to 
get  new,  more  complicated  functions.  To  make  good  use  of  the  information  provided  by  /'(x)  we  need  to 
be  able  to  compute  it  for  a  variety  of  such  functions. 

We  will  begin  to  use  different  notations  for  the  derivative  of  a  function.  While  initially  confusing,  each 
is  often  useful  so  it  is  worth  maintaining  multiple  versions  of  the  same  thing. 

Consider  again  the  function  f(x)  —  \J 625  —  x2.  We  have  computed  the  derivative  f  (x)  =  —xf  \J 625  —  x2, 
and  have  already  noted  that  if  we  use  the  alternate  notation  y  —  625  —  x2  then  we  might  write  y  = 

— x/ \J 625  —x2.  Another  notation  is  quite  different,  and  in  time  it  will  become  clear  why  it  is  often  a 
useful  one.  Recall  that  to  compute  the  the  derivative  of  /  we  computed 

J 625  —  (7  +  At)2  —  24 
lim  — - - - . 

Ax 

The  denominator  here  measures  a  distance  in  the  x  direction,  sometimes  called  the  “run”,  and  the  numer¬ 
ator  measures  a  distance  in  the  y  direction,  sometimes  called  the  “rise,”  and  “rise  over  run”  is  the  slope  of 
a  line.  Recall  that  sometimes  such  a  numerator  is  abbreviated  Ay,  exchanging  brevity  for  a  more  detailed 
expression.  So  in  general,  we  define  a  derivative  by  the  following  equation. 
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To  recall  the  form  of  the  limit,  we  sometimes  say  instead  that 


dy 

dx 


lim  % 

Ax^>0  Ax 


In  other  words,  dy/dx  is  another  notation  for  the  derivative,  and  it  reminds  us  that  it  is  related  to  an  actual 
slope  between  two  points.  This  notation  is  called  Leibniz  notation,  after  Gottfried  Leibniz,  who  developed 
the  fundamentals  of  calculus  independently,  at  about  the  same  time  that  Isaac  Newton  did.  Again,  since 
we  often  use  /  and  f(x)  to  mean  the  original  function,  we  sometimes  use  df/dx  and  df(x)/dx  to  refer  to 
the  derivative.  If  the  function  f(x)  is  written  out  in  full  we  often  write  the  last  of  these  something  like  this 

fix)  =  V' 625  —  x2 

dx 

with  the  function  written  to  the  side,  instead  of  trying  to  fit  it  into  the  numerator. 


Solution.  We  compute 


lim 

A/— >0 


Ay 

At 


—  lim 
A/— >0 


(t-y  At)2  —  t2 
At 


f  +  2tAt  +  At2  - 12 

—  lim  - 

A/— >0  At 


2tAt  +  At~ 

=  lim  - 

A/— >0  At 


—  Iim2t  +  At  =  2t. 

A/— >0 

X 

Remember  that  At  is  a  single  quantity,  not  a  “A”  times  a  “t”,  and  so  At 2  is  (At)2  not  A ( / 2 ) .  Doing 
the  same  example  using  the  second  formula  for  the  derivative  with  h  in  place  of  At  gives  the  following. 
Note  that  we  compute  /(t  +  h)  by  substituting  t  +  h  in  place  of  t  everywhere  we  see  t  in  the  expression 
fit),  while  making  no  other  changes  (at  least  initially).  For  example,  if  /(t)  =  t  +  y/(t  +  3)2  —  t  then 
f(t-\-h)  —  (t  +  h.)  +  y/((t  +  h)  +  3)2  —  (t  +  h)  =  t  -\-h-\-  y/(t  +  h  +  3)2  —  t  —  h. 


4.2.  The  Derivative  Function  123 


Solution.  We  compute 


m 


h-t  0  h 


=  lim 

/l— 5-0 


(t  +  h)2-t2 

h 


t~  +  2th  +  h2  —  t2 

=  lim - - - 

/?-*•()  h 


2th +  h2 
—  lim - - - 

/z — 5-0  h 


—  \im2t  +  h  =  2t. 

/z — 5-0 


X 


Solution.  The  computation: 


/ 

y 


Ay->o  Ax 


_j _ l 

lim  * 

Ax — ^0  Ax 

■Y _ ,Y+Ay 

.y(.y+Ay)  y(.y+Ay) 

Ay— >0  Ax 

x—  (.y+Ay) 
y(.y+Ay) 

lim  — — — 

Ay-S-0  Ax 


lim 

Ay— >0 


x  —  x  —  Ax 
x(x  +  Ax)  Ax 


lim 

Ay— >0 


— Ax 

x(x  +  Ax)  Ax 


lim 

Ay— >0 


-1 

x(x  +  Ax) 


-1 


X 


2 


X 

Note:  If  you  happen  to  know  some  “derivative  formulas”  from  an  earlier  course,  for  the  time  being 
you  should  pretend  that  you  do  not  know  them.  In  examples  like  the  ones  above  and  the  exercises  below, 
you  are  required  to  know  how  to  find  the  derivative  formula  starting  from  basic  principles.  We  will  later 
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develop  some  formulas  so  that  we  do  not  always  need  to  do  such  computations,  but  we  will  continue  to 
need  to  know  how  to  do  the  more  involved  computations. 


To  recap,  given  any  function  /  and  any  number  x  in  the  domain  of  /,  we  define  f'(x)  —  lirri/,  ,o  i&ddlCifbf 
wherever  this  limit  exists,  and  we  call  the  number  f'{x )  the  derivative  of  /  at  x.  Geometrically,  f'(x)  is  the 
slope  of  the  tangent  line  to  the  graph  of  /  at  the  point  (x,/(x)).  The  following  symbols  also  represent  the 
derivative: 


,//  \  /  dy  df  d 

nx)  =  y=Tx  =  Tx  =  Txf{x)- 


The  symbol  d/dx  is  called  a  differential  operator  which  means  to  take  the  derivative  of  the  function  f(x) 
with  respect  to  the  variable  x. 

In  the  next  example  we  emphasize  the  geometrical  interpretation  of  derivative. 


r  i 

Example  4.6:  Geometrical  Interpretation  of  Derivative 

Consider  the  function  f(x)  given  by  the  graph  be 
derivative  of  f(x)  by  analyzing  slopes  of  tangent  1 

y 

low.  Verify  that  the  graph  of  f(x)  is  indeed  the 
ines  to  the  graph  at  different  points. 

f(x) 

V\ 

Solution.  We  must  think  about  the  tangent  lines  to  the  graph  of  /,  because  the  slopes  of  these  lines  are  the 
values  of  /'( x). 

We  start  by  checking  the  graph  of  /  for  horizontal  tangent  lines,  since  horizontal  lines  have  a  slope  of  0. 
We  find  that  the  tangent  line  is  horizontal  at  the  points  where  x  has  the  values  -1.9  and  1.8  (approximately). 
At  each  of  these  values  of  x,  we  must  have  f'(x)  —  0,  which  means  that  the  graph  of  f  has  an  x- intercept 
(a  point  where  the  graph  intersects  the  x-axis). 

Note  that  horizontal  tangent  lines  have  a  slope  of  zero  and  these  occur  approximately  at  the  points 
(—1.9,  —3.2)  and  (1.8, 3.2)  of  the  graph.  Therefore  f'(x)  will  cross  the  x-axis  when  x  =  — 1.9  andx  =1.8. 

Analyzing  the  slope  of  the  tangent  line  of  /(x)  at  x  =  0  gives  approximately  3.0,  thus,  /'( 0)  =  3.0. 
Similarly,  analyzing  the  slope  of  the  tangent  lines  of  /(x)  at  x  =  1  and  x  =  —  1  give  approximately  2.0  for 
both,  thus,  /'(l)  —  /'(  — 1)  =  2.0.  X 

In  the  next  example  we  verify  that  the  slope  of  a  straight  line  is  m. 
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Example  4.7:  Derivative  of  a  Linear  Function 


Let  m,  b  be  any  two  real  numbers.  Determine  f'(x)  if  f(x)  =  mx  +  b. 


Solution.  By  the  definition  of  derivative  (using  h  in  place  of  Ax)  we  have, 

f'(x)  =  lim  +  -/(*)  =  lim  (m(x  +  h)+b)-{mx  +  b) 

1  '  h^O  h  h^O  h 


—  lim  — -  =  lim  m  =  m. 

/?— >0  h  /i — >-0 

This  is  not  surprising.  We  know  that  /'(x)  always  represents  the  slope  of  a  tangent  line  to  the  graph  of 
/.  In  this  example,  since  the  graph  of  /  is  a  straight  line  y  =  mx  +  b  already,  every  tangent  line  is  the  same 
line  y  =  mx  +  b.  Since  this  line  has  a  slope  of  m,  we  must  have  f'(x )  =  m.  A 


4.2.1.  Differentiable 


Now  that  we  have  introduced  the  derivative  of  a  function  at  a  point,  we  can  begin  to  use  the  adjective 

differentiable. 


Definition  4.8:  Differentiable  at  a  Point 


A  function  f  is  differentiable  at  point  a  iff'  {a)  exists. 


Definition  4.9:  Differentiable  on  an  Interval 


A  function  f  is  differentiable  on  an  open  interval  if  it  is  differentiable  at  every  point  in  the  interval. 


Sometimes  one  encounters  a  point  in  the  domain  of  a  function  y  —  f(x)  where  there  is  no  derivative, 
because  there  is  no  tangent  line.  In  order  for  the  notion  of  the  tangent  line  at  a  point  to  make  sense,  the 
curve  must  be  “smooth”  at  that  point.  This  means  that  if  you  imagine  a  particle  traveling  at  some  steady 
speed  along  the  curve,  then  the  particle  does  not  experience  an  abrupt  change  of  direction.  There  are 
two  types  of  situations  you  should  be  aware  of — corners  and  cusps — where  there’s  a  sudden  change  of 
direction  and  hence  no  derivative. 


Example  4.10:  Derivative  of  the  Absolute  Value 


Discuss  the  derivative  of  the  absolute  value  function  y  —  f(x)  —  \x 


Solution.  If  x  is  positive,  then  this  is  the  function  y  —  x,  whose  derivative  is  the  constant  1.  (Recall  that 
when  y  =  /(x)  =  mx  +  b,  the  derivative  is  the  slope  m.)  If  x  is  negative,  then  we’re  dealing  with  the 
function  y  =  — x,  whose  derivative  is  the  constant  —1.  If  x  =  0,  then  the  function  has  a  comer,  i.e.,  there 
is  no  tangent  line.  A  tangent  line  would  have  to  point  in  the  direction  of  the  curve — but  there  are  two 
directions  of  the  curve  that  come  together  at  the  origin. 
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We  can  summarize  this  as 


1,  ifx>0, 
—  1,  ifx<0, 
undefined,  if  x  —  0. 


In  particular,  the  absolute  value  function  /(x)  —  |x|  is  not  differentiable  at  x  =  0. 


We  note  that  the  following  theorem  can  be  proved  using  limits. 


X 


Proof.  Suppose  that  /  is  differentiable  at  a.  That  is, 


/(«) 


lim 

/!->  0 


f(a  +  h)-f(a) 
h 


exists.  At  this  stage,  we  find  it  convenient  to  write  this  limit  in  an  alternative  form  so  that  its  connection 
with  continuity  can  become  more  easily  seen.  If  we  let  x  —  a  +  h,  then  h—x  —  a.  Furthermore,  h—>  0  is 
equivalent  to  x  — *  a.  So, 


/'(«) 


lim 

x— >a 


f{x)-f{a) 


(This  alternative  formulation  of  the  derivative  is  also  standard.  We  will  use  it  whenever  we  find  it  conve¬ 
nient  to  do  so.  You  should  get  familiar  with  it.)  Continuity  at  a  can  now  be  proved  as  follows: 


lim/  (x) 

x — ya 


,im(7M  zm.(x 

x—ta  \  x  —  a 


«)+/(«) 


lim  .  Hm  (x  —  a)  +  lim  /  (a) 

x-^ta  X  —  CL  x— >a  x-^ta 

f  (a)  •  (a  —  a)  +/  (a) 

/(«)• 


X 

However,  if  /  is  continuous  at  a  it  is  not  necessarily  true  that  /  is  differentiable  at  a.  For  example,  it 
was  shown  that  /(x)  =  |x|  is  not  differentiable  at  x  =  0  in  the  previous  example,  however,  one  can  observe 
that  /(x)  =  |x|  is  continuous  everywhere. 
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Example  4.12:  Derivative  of  y  —  x2/3 


Discuss  the  derivative  of  the  function  y  —  x2/3,  shown  in  Figure  4.2. 


Solution.  We  will  later  see  how  to  compute  this  derivative;  for  now  we  use  the  fact  that  y  —  (2/3)x~  l/3. 
Visually  this  looks  much  like  the  absolute  value  function,  but  it  technically  has  a  cusp,  not  a  comer.  The 
absolute  value  function  has  no  tangent  line  at  0  because  there  are  (at  least)  two  obvious  contenders — the 
tangent  line  of  the  left  side  of  the  curve  and  the  tangent  line  of  the  right  side.  The  function  y  —  x2//3  does 
not  have  a  tangent  line  at  0,  but  unlike  the  absolute  value  function  it  can  be  said  to  have  a  single  direction: 
as  we  approach  0  from  either  side  the  tangent  line  becomes  closer  and  closer  to  a  vertical  line;  the  curve 
is  vertical  at  0.  But  as  before,  if  you  imagine  traveling  along  the  curve,  an  abrupt  change  in  direction  is 
required  at  0:  a  full  180  degree  turn.  & 


Figure  4.2:  A  cusp  on  x2/3. 


In  practice  we  won’t  worry  much  about  the  distinction  between  these  examples;  in  both  cases  the 
function  has  a  “sharp  point”  where  there  is  no  tangent  line  and  no  derivative. 

4.2.2.  Second  and  Other  Derivatives 


If  /  is  a  differentiable  function  then  its  derivative  f  is  also  a  function  and  so  we  can  take  the  derivative  of 
/'.  The  new  function,  denoted  by  f",  is  called  the  second  derivative  of  /,  since  it  is  the  derivative  of  the 
derivative  of  /. 

The  following  symbols  represent  the  second  derivative: 


d  f dy\ 
dx  \  dx  J 


We  can  continue  this  process  to  get  the  third  derivative  of  /. 

In  general,  the  nth  derivative  of  /  is  denoted  by  /(")  and  is  obtained  from  /  by  differentiating  n  times. 
If  y  —  /(x),  then  we  write: 

Jn)  _  An)  r  \  _  dy 
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4.2.3.  Velocities 


Suppose  f(t)  is  a  position  function  of  an  object,  representing  the  displacement  of  the  object  from  the 
origin  at  time  t.  In  terms  of  derivatives,  the  velocity  of  an  object  is: 

v(a)  =/(«) 

The  change  of  velocity  with  respect  to  time  is  called  the  acceleration  and  can  be  found  as  follows: 

a(t)=v\t)=f(t). 

Acceleration  is  the  derivative  of  the  velocity  function  and  the  second  derivative  of  the  position  function. 


Example  4.13:  Position,  Velocity  and  Acceleration 


Suppose  the  position  function  of  an  object  is  f(t )  =  l2  metres  at  t  seconds.  Find  the  velocity  and 
acceleration  of  the  object  at  time  t  —  Is. 


Solution.  By  the  definition  of  velocity  and  acceleration  we  need  to  compute  f'(t)  and  f"{t).  Using  the 
definition  of  derivative,  we  have, 


fit) 


lim 

h-v  0 


(t  +  h)2-t2 
h 


2  th  +  h2 

lim - - - 

h->0  h 


lim  (2  t  +  h)  —  2 1. 
h—>  0 


Therefore,  v(t)  =  f{t)  —  It.  Thus,  the  velocity  at  time  t  —  1  is  v(l)  =  2  m/s.  We  now  have  that  the 
acceleration  at  time  t  is: 


Therefore,  a(t) 


,  .  ...  .  2(t  +  h)  —  2t  2 h 

ait )  =  f  it)  —  lim - - - =  lim  —  =  2. 

h — h  h^o  h 


2.  Substituting  t  —  1  into  the  function  ail)  gives  a{\)  —  2  m/s2. 


X 


Exercises  for  Section  4.2 


Exercise  4.2.1  Find  the  derivatives  of  the  following  functions. 

(a)  y  —  f(x)  =  \J  169  —x2 
ib)  y  —  fit)  —  80  — 4.9t2 

(c)  y  =  f{x)  =x2-(l/x) 

(d)  y  —  f{x)  —  ax2  +  bx  +  c,  where  a,  b,  and  c  are  constants. 

(e)  y  =  /(*)=* 3 
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(f)  y  =  f(x)  =  2/V2x+  1 

(g)  y  =  g(t )  =  (2t  —  1)/ (t  +  2) 

Exercise  4.2.2  Shown  is  the  graph  of  a  function  fix).  Sketch  the  graph  off'  (x)  by  estimating  the  deriva¬ 
tive  at  a  number  of  points  in  the  interval:  estimate  the  derivative  at  regular  intervals  from  one  end  of  the 
interval  to  the  other,  and  cdso  at  “special”  points,  as  when  the  derivative  is  zero.  Make  sure  you  indicate 
any  places  where  the  derivative  does  not  exist. 


Exercise  4.2.3  Shown  is  the  graph  of  a  function  fix).  Sketch  the  graph  off'  (x)  by  estimating  the  deriva¬ 
tive  at  a  number  of  points  in  the  interval:  estimate  the  derivative  at  regular  intervals  from  one  end  of  the 
interval  to  the  other,  and  cdso  at  “special”  points,  as  when  the  derivative  is  zero.  Make  sure  you  indicate 
any  places  where  the  derivative  does  not  exist. 
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—  5  —  x  —  3x  at  the  point  x  — 2 
has  a  horizontal  tangent  line  at 

4.3  Derivative  Rules 


Exercise  4.2.4  Find  an  equation  for  the  tangent  line  to  the  graph  of  f(x) 


Exercise  4.2.5  Find  a  value  for  a  so  that  the  graph  of  f(x)  —  x2  +  ax  —  3 
x  =  4. 


Using  the  definition  of  the  derivative  of  a  function  is  quite  tedious.  In  this  section  we  introduce  a  number 
of  different  shortcuts  that  can  be  used  to  compute  the  derivative.  Recall  that  the  definition  of  derivative  is: 

Given  any  number  x  for  which  the  limit 


/M 


]im  f{x  +  h)-f(x) 
/z — s-o  h 


exists,  we  assign  to  x  the  number  /'(x). 

Next,  we  give  some  basic  derivative  rules  for  finding  derivatives  without  having  to  use  the  limit  defi¬ 
nition  directly. 


Proof.  Let  /(x)  =  c  be  a  constant  function.  By  the  definition  of  derivative: 


/« 


Hm  f(x  +  h)-f(x) 
h- >0  h 


lim 

/z— >o 


lim  0  =  0. 

/z— >0 


* 


Example  4.15:  Derivative  of  a  Constant  Function 


The  derivative  of  f(x)  —  17  is  f'{x)  =  0  since  the  derivative  of  a  constant  is  0. 


Proof.  We  use  the  formula: 


x"  -  an  =  (x  -  a)  (x”-1  +f’~2a  +  •  •  •  +xa"'2  +  a'7’1) 
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which  can  be  verified  by  mulitplying  out  the  right  side.  Let  /(x)  =  x"  be  a  power  function  for  some 
positive  integer  n.  Then  at  any  number  a  we  have: 


f,,  \  r  f{x)-f{a)  . 

/  (a)  —  lim - =  lim 


x — ya  x  —  a  x — yo  x  —  a  j c— 


=  lim(xn  1  +xn  2 a  H - \-xa'1  2+an  1 )  =  nar‘ 


It  turns  out  that  the  Power  Rule  holds  for  any  real  number  n  (though  it  is  a  bit  more  difficult  to  prove). 


Example  4.18:  Derivative  of  a  Power  Function 


By  the  power  rule,  the  derivative  of  g(x)  =  x4  is  g'(x)  =  4x3. 


Proof.  For  convenience  let  g(x)  =  c/(x).  Then: 


s'M  =  lim  =  lim  Cf(x  +  h)-cf(x) 

h^O  h  h->  o  h 


—  lim  c 

/j— >0 


/(x  +  fi)  -/(x) 
h 


=  +  =cf  W. 

/z-^0  /?  W 


where  c  can  be  moved  in  front  of  the  limit  by  the  Limit  Rules. 
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Proof.  For  convenience  let  r(x)  —  f(x)  ±g(x).  Then: 


r\x) 


r(x  +  h)  —  r(x) 
h->  0  h 


lim  +  +  ~  [/(*)  ±g(*)] 

h—>  o  h 


lim 

/?->  0 


f(x  +  h)-f(x)  g(x  +  h)-g(x) 


h 


h 


lim  f(x  +  h)-f(x)  ±  lim  g(*  +  *Q-g(*) 

h—t  o  h  /i — 5-0  h 


=  A*)+g\x) 


* 


Proof.  For  convenience  let  r(jt)  =  fix)  ■  g(x).  As  in  the  previous  proof,  we  want  to  separate  the  functions 
/  and  g.  The  trick  is  to  add  and  subtract  f(x  +  h)g(x)  in  the  numerator.  Then: 

r'(x)  =  lim  f(x  +  h^8(x  +  hAf(x  +  h)8(x)  +f(x  +  h)g(x)-f{x)g(x) 

1  '  h^O  h 


—  lim 

/z— >0 


f^+hf±±hzm+g{x)fA*+h)  -/m 


h 


h 


rt  A  ,■  g(x  +  h)  —g(x)  fix  +  h)  —  fix) 

=  lim  fix  +  h)  ■  lim  - - -  +  lim  g(jc)  •  lim  - - 

/z->0  /z— >0  h  h-+  0  /7— >0  /l 


=  /(^'W+^W/W 


4 
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Solution.  One  way  to  do  this  question  is  to  expand  the  expression.  Alternatively,  we  use  the  product  rule 
with  /(x)  —  3x  —  1  and  g(x)  =  2x  +  3.  Note  that  f'(x)  =  3  and  g'(x)  —  2,  so, 

h'(x)  =  (3)  •  (2x  +  3)  +  (3x  —  1)  •  (2)  =  6x  +  9  +  6x  — 2  =  12x  +  7. 


X 


Theorem  4.25:  The  Quotient  Rule 

Iff  and  g  are  both  differentiabh 

d 

dx 

2  fund 

7W" 

g(x). 

Ions,  then 

_  s(x)ic\f(x)}  ~f(x)ic [$(*)] 

feW]2 

Proof.  The  proof  is  similar  to  the  previous  proof  but  the  trick  is  to  add  and  subtract  the  term  f(x)g(x)  in 
the  numerator.  We  omit  the  details.  X 


Solution.  By  the  quotient  rule  (using  /(x)  =  3x  —  1  and  g(x)  —  2x  +  3)  we  have: 

,/M  &(3x-1)-(2x+3)-(3x-1)-£(2x+3) 

W~  (ii+3  y- 

_  3(2x  +  3)  —  (3x—  1)(2)  _  11 

(2x  +  3)2  (2x  +  3)2 


X 


Solution.  We  must  differentiate  /(x)  twice: 


f(x)  —  15x2  +  6x, 
f"  (x)  =  30x  +  6. 
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Exercises  for  Section  4.3 


Exercise  4.3.1  Find  the  derivatives  of  the  following  functions. 


(a) 

x100 

(f) 

X-W 

(k) 

(x+  l)(x2  +  2x  —  3) 

(b) 

x-100 

(8) 

5x3  +  12x2  —  15 

(D 

x3(x3  —  5x4- 10) 

(c) 

1 

(h) 

— 4x5  +  3x2  —  5/x2 

(m) 

(x2  +  5x  —  3)(x5) 

(d) 

X* 

(i) 

5(-3x2  +  5x+1) 

(n) 

(x2  +  5x  —  3)(x-5) 

(e) 

X3/4 

(j) 

(x+  1)(x2  +  2x  —  3) 

(o) 

(5x3  +  12x2  —  15)-1 

Exercise  4.3.2  Find  an  equation  for  the  tangent  line  to  f(x)  =  x3/4  —  1  jx  at  x  —  —2. 

Exercise  4.3.3  Find  an  equation  for  the  tangent  line  to  fix)  =  3x2  —  tt3  at  x  =  4. 

Exercise  4.3.4  Suppose  the  position  of  an  object  at  time  t  is  given  by  f(t )  =  — 49t2/ 1 0  +  5/  +  10.  Find  a 
function  giving  the  speed  of  the  object  at  time  t.  The  acceleration  of  an  object  is  the  rate  at  which  its  speed 
is  changing,  which  means  it  is  given  by  the  derivative  of  the  speed  function.  Find  the  acceleration  of  the 
object  at  time  t. 

Exercise  4.3.5  Let  f(x)  —  x3  and  c  =  3.  Sketch  the  graphs  off,  cf,  f,  and  ( cf )'  on  the  same  diagram. 


n 

Exercise  4.3.6  The  general  polynomial  P  of  degree  n  in  the  variable  x  has  the  form  P(x)  —  ^  a^x  = 

k=  o 


ao  +  a\x  + . . .  4-  anxM.  What  is  the  derivative  ( with  respect  to  x)  ofP? 


Exercise  4.3.7 


Find  a  cubic  polynomial  whose  graph  has  horizontal  tangents  at  (—2,5)  and  (2,3). 


Exercise  4.3.8 


Prove  that 


d_ 

dx 


(c/(x))  =  cf  [x )  using  the  definition  of  the  derivative. 


Exercise  4.3.9  Suppose  that  f  and  g  are  differentiable  at  x.  Show  that  f  —  g  is  differentiable  at  x  using 
the  two  linearity  properties  from  this  section. 


Exercise  4.3.10  Use  the  product  rule  to  compute  the  derivative  off(x)  —  (2x  —  3)2.  Sketch  the  function. 
Find  an  equation  of  the  tangent  line  to  the  curve  at  x—  2.  Sketch  the  tangent  line  at  x  —  2. 


Exercise  4.3.11  Suppose  that  f,  g,  and  h  are  differentiable  functions.  Show  that  ( fgh)'(x )  =  f(x)g(x)h(x)  + 
f(x)g'(x)h(x)  +f(x)g(x)h'(x). 


x3  —  5x+  10 


Exercise  4.3.12  Compute  the  derivative  of 
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Exercise  4.3.13  Compute  the  derivative  of 
Exercise  4.3.14  Compute  the  derivative  of 

Exercise  4.3.15  Compute  the  derivative  of 


x2  T  5x  —  3 


x5  —  6x 3  +  3x2  —  7  x  +  1 
x 


Vx-625' 

Vx-5 

y-20  ‘ 


Exercise  4.3.16  Find  an  equation  for  the  tangent  line  to  f{x)  —  (x2  —  4)  /  (5  —  x)  at  x=  3. 

Exercise  4.3.17  Find  an  equation  for  the  tangent  line  to  f(x )  =  (x  —  2)/(x3  +4x  —  1)  at  x  —  1. 

Exercise  4.3.18  If  f'{ 4)  =  5,  g\ 4)  =  12,  (fg)( 4)  =  /(4)g(4)  =  2,  and  g(4)  =  6,  compute  /( 4)  and  —  — 

(XX  ^ 

at  4. 


4.4  Derivative  Rules  for  Trigonometric  Functions 


We  next  look  at  the  derivative  of  the  sine  function.  In  order  to  prove  the  derivative  formula  for  sine,  we 
recall  two  limit  computations  from  earlier: 


lim 

jc— >0 


sinx 

x 


and 


cosx— 1 

lim - 

x — ^0  X 


=  0, 


and  the  double  angle  formula 


sin  (A  +  B)  —  sinAcos5  +  sinficosA. 
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Proof.  Let  /(x)  =  sinx.  Using  the  definition  of  derivative  we  have: 


/« 


lim 

/t->  0 


f(x  +  h)  -fix) 
h 


sin(x  +  h)  —  sinx 

=  lim - 

h^O  h 


sinxcos/i-b  cosx  sin /i  —  sinx 

=  lim - 

0  ll 


h  —  1  cin  h 

=  lim  sinx -lim — - - b  lim  cos  x-  lim  - - 

h — ^0  /j-t’O  h  h— >0  h— >0  h 


=  sinx- 0  + cosx- 1 


=  cosx 


since 


sinx 
lim - 

x — ^0  X 


.  cosx— 1 

1  and  lim - =  0. 

x — ^0  X 


X 


A  formula  for  the  derivative  of  the  cosine  function  can  be  found  in  a  similar  fashion: 

d  ,  \ 

—  (cosx)  =  —  sinx. 
dx 

Using  the  quotient  rule  we  get  formulas  for  the  remaining  trigonometric  ratios.  To  summarize,  here 
are  the  derivatives  of  the  six  trigonometric  functions: 


-^-(sin(x))  =  cos(x) 

(tan(x))  =  sec2(x) 

-y-  (csc(x))  =  — csc(x)cot(x) 

CvX/ 

——  (cos(x))  =  — sin(x) 

CIX 

-^-(cot(x))  =  — csc2(x) 

CIX 

(sec(x))  =  sec(x)tan(x) 

CIX 

Example  4.29:  Derivative  of  Product  of  Trigonometric  Functions 


Find  the  derivative  of  f(x)  —  sinxtanx. 


Solution.  Using  the  Product  Rule  we  obtain 

f'{x)  —  cosxtanx  +  sinxsec2x. 
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Exercises  for  Section  4.4 


Exercise  4.4.1  Find  the  derivatives  of  the  following  functions. 

(a)  sinxcosx  (b)  cotx  (c)  cscx  — xtanx 

Exercise  4.4.2  Find  the  points  on  the  curx’e  y  =  x  +  2cosx  that  have  a  horizontal  tangent  line. 


4.5  The  Chain  Rule 


Let  h(x)  —  \/ 625  —  x2.  The  rules  stated  previously  do  not  allow  us  to  find  h'(x).  However,  h(x)  is  a 
composition  of  two  functions.  Let  /(x)  =  yfx  and  g(x)  =  625  —  x2.  Then  we  see  that 

h(x)  =  (fog)(x). 

From  our  rules  we  know  that  f  (x)  =  ^x-1/2  and  gfx)  —  —  2x,  thus  it  would  be  convenient  to  have  a  rule 
which  allows  us  to  differentiate  f  o  g  in  terms  of  f  and  g' .  This  gives  rise  to  the  chain  rule. 


The  Chain  Rule 


If  g  is  differentiable  at  x  and  /  is  differentiable  at  g(x),  then  the  composite  function  h  —  fog  [recall 
fog  is  defined  as  f(g(x))\  is  differentiable  atx  and  h'(x)  is  given  by: 

h'{x)  =f(g{x))-g'{x). 


The  chain  rule  has  a  particularly  simple  expression  if  we  use  the  Leibniz  notation  for  the  derivative. 
The  quantity  /'(g(x))  is  the  derivative  of  /  with  x  replaced  by  g;  this  can  be  written  df/dg.  As  usual, 
g'(x)  =  dg/dx.  Then  the  chain  rule  becomes 

df_  _  d£d^ 

dx  dg  dx 

This  looks  like  trivial  arithmetic,  but  it  is  not:  dg/dx  is  not  a  fraction,  that  is,  not  literal  division,  but  a 
single  symbol  that  means  g'{x).  Nevertheless,  it  turns  out  that  what  looks  like  trivial  arithmetic,  and  is 
therefore  easy  to  remember,  is  really  true. 

It  will  take  a  bit  of  practice  to  make  the  use  of  the  chain  rule  come  naturally — it  is  more  complicated 
than  the  earlier  differentiation  rules  we  have  seen. 
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Solution.  We  already  know  that  the  answer  is  —xj  yj 625  —x2,  computed  directly  from  the  limit.  In  the 
context  of  the  chain  rule,  we  have  /(x)  —  y/x,  g(x)  =  625  —x2.  We  know  that  f'(x)  —  (l/2)x_1//2,  so 
f'(g(x))  —  (1/2)  (625  —x2)-1/2.  Note  that  this  is  a  two  step  computation:  first  compute  f'(x),  then  replace 
x  by  g(x).  Since  g'(x)  —  —2x  we  have 


i 

2V625-X2 


(~2x) 


—x 

V 625  —  x2 


X 


Solution.  This  is  a  quotient  with  a  constant  numerator,  so  we  could  use  the  quotient  rule,  but  it  is  simpler 
to  use  the  chain  rule.  The  function  is  (625  —  x2)-1/2,  the  composition  of  /(x)  —  x”1/2  and  g(x)  =  625  — x2. 
We  compute  f{x )  =  (  —  l/2)x~3/2  using  the  power  rule,  and  then 

=  2(625 -‘,2)3/2  =  (625  —",2)3/2 

* 

In  practice,  of  course,  you  will  need  to  use  more  than  one  of  the  rules  we  have  developed  to  compute 
the  derivative  of  a  complicated  function. 


Solution.  The  “last”  operation  here  is  division,  so  to  get  started  we  need  to  use  the  quotient  rule  first.  This 
gives 

,,  .  _  (x2  -  1  y xy/ x2  +  1  —  (x2  -  l)(x\/x2  +  1)' 

~  x2(x2  +  l) 

2x2Vx2+T—  (x2  —  l)(x\/x2  +  1)' 
x2  (x2  T  1 ) 


Now  we  need  to  compute  the  derivative  of  x\/ x2  +  1.  This  is  a  product,  so  we  use  the  product  rule: 


—x\/ X2  +  1  =  x—  V*2  +  1  +  V X2  +  1  ■ 
dx  dx 

Finally,  we  use  the  chain  rule: 

±v/JT~l  =  ±(x2  +  i)W2='ixl+iri/2{2x)  = 


dx 


dx 


V x2  + 1 
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And  putting  it  all  together: 


/«  = 


2x2  Vx2  +  1  —  (v2  —  1 )  (xy/x2  +  1/ 


x2(x 2  +  r 


2x2\Zx2  +  1  —  (x2  —  1) 


y/x2  + 1 


+  Vx2  +  l 


x2(x2  + 1) 


This  can  be  simplified  of  course,  but  we  have  done  all  the  calculus,  so  that  only  algebra  is  left. 


Example  4.33:  Chain  of  Composition 


Solution.  Here  we  have  a  more  complicated  chain  of  compositions,  so  we  use  the  chain  rule  twice.  At 


the  outermost  “layer”  we  have  the  function  g(v)  =  1  +  \J  1  +  y/x  plugged  into  f(x)  —  y/x,  so  applying  the 
chain  rule  once  gives 


cl 


1 


^V1  +  V1  +  ^-  2  1  1  + V  1  +  ^  (  1  + V  1  +  \A  I  • 


-1/2 


d 


Now  we  need  the  derivative  of  \J  \  + y/x.  Using  the  chain  rule  again: 


d 


1 


1/2  1-1/2 


+  +  '  2* 


So  the  original  derivative  is 


cl 

dx 


\J\  +  \J\  +  y/x  —  -  ^1  +  \/l  +  vs)  +  '  2X  ^2‘ 

1 

8 y/x\ /l  +  y/x \J  1  +  yf l  +  y/x 


Using  the  chain  rule,  the  power  rule,  and  the  product  rule,  it  is  possible  to  avoid  using  the  quotient  rule 
entirely. 


Example  4.34:  Derivative  of  Quotient  without  Quotient  Rule 


Compute  the  derivative  of  f(x)  — 


x 


c2  +  1 
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Solution.  Write  /(x)  =  x3  (x2  +  1 )  1 ,  then 

f'{x)  =  x3^-(x2  +  1)_1  +  3x2(x2  +  1)~] 
ax 

=  JC3(— 1)(jc2  +  1)-2(2jc)  +  3jc2(jc2  +  l)-1 

=  — 2x4(x2  +  l)-2  +  3x2(x2  +  1)~! 

— 2x4  3x2 

(x2  +  l)2  X2  +  1 

— 2x4  3x2(x2  +  1) 

(x2  +  l)2  (x2  +  l)2 

— 2x4  +  3x4  +  3x2  x4  +  3x2 

(x2  +  l)2  (x2  +  l)2 

Note  that  we  already  had  the  derivative  on  the  second  line;  all  the  rest  is  simplification.  It  is  easier  to  get 
to  this  answer  by  using  the  quotient  rule,  so  there’s  a  trade  off:  more  work  for  fewer  memorized  formulas. 

* 


Exercises  for  Section  4.5 


Find  the  derivatives  of  the  functions.  For  extra  practice,  and  to  check  your  answers,  do  some  of  these  in 
more  than  one  way  if  possible. 

Exercise  4.5.1  x4  —  3x3  +  ( 1  / 2)x2  +  lx  —  n 

Exercise  4.5.2  x3  —  2x2  +  Ay/x 
Exercise  4.5.3  (x2  +  l)3 
Exercise  4.5.4  x\J  169  — x2 
Exercise  4.5.5  (x2  —  4x  +  5 )  \/ 25  —  x2 
Exercise  4.5.6  \J r2  —  x2,  r  is  a  constant 
Exercise  4.5.7  \J  1  4-x4 

Exercise  4.5.8  — . 

\fS--yJx 

Exercise  4.5.9  (1  +  3x)2 
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Exercise  4.5.10 

Exercise  4.5.11 

Exercise  4.5.12 

Exercise  4.5.13 

Exercise  4.5.14 
Exercise  4.5.15 

Exercise  4.5.16 

Exercise  4.5.17 
Exercise  4.5.18 
Exercise  4.5.19 
Exercise  4.5.20 
Exercise  4.5.21 
Exercise  4.5.22 
Exercise  4.5.23 

Exercise  4.5.24 

Exercise  4.5.25 

Exercise  4.5.26 
Exercise  4.5.27 


(x2+x  +  1) 
(1-x) 


V25  —  x2 
x 


sJx3-x2-(\/x) 

100/(100  -x2)3/2 
\/x  +  x3 

J (x2  T  l)2  T  \Jl  +  (x2  +  l)2 

(x+8)5 
(4  —  x)3 
(x2  +  5)3 
(6-2x2)3 
(1-4x3)~2 
5(x  +  1  —  1/x) 

4(2x2  —  x  +  3)-2 
1 

1  +  1/x 
-3 

4x2  —  2x  +  1 
(x2  +  1)(5  —  2x)/2 
(3x2  +  l)(2x  —  4)3 


Exercise  4.5.28 


x  T  1 
x—  1 
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Exercise  4.5.29 


Exercise  4.5.30 


Exercise  4.5.31 


x2-l 

X2  +  1 

(*-!)(*  — 2) 
x  —  3 

2JT1— x~2 


3x  1  —  4x  2 
Exercise  4.5.32  3 (x2  +  1 )  (2x2  —  1 )  (2x  +  3) 

Exercise  4.5.33 


(2x+  l)(x  —  3) 

Exercise  4.5.34  ((2x  +  1)~!  +3)_1 
Exercise  4.5.35  (2x+  l)3(x2  +  l)2 

Exercise  4.5.36  Find  an  equation  for  the  tangent  line  to  /(x)  =  (x  —  2) 1//3/ (x3  +  4x  —  l)2  at  x  —  1. 
Exercise  4.5.37  Find  an  equation  for  the  tangent  line  toy  =  9x~2  at  (3, 1). 

Exercise  4.5.38  Find  an  equation  for  the  tangent  line  to  (x2  —  4x  +  5)  \J  25  —  x2  at  (3, 8). 

(x2+x+  1) 


1  —  x) 


at  (2,  —7). 


Exercise  4.5.39  Find  an  equation  for  the  tangent  line  to 


Exercise  4.5.40  Find  an  equation  for  the  tangent  line  to  y  (x2  +  l)2  +  y  1  +  (x2  +  l)2  at  (1,  y  4  +  x/5). 

Exercise  4.5.41  Let  y  —  f(x)  and  x  =  g(t).  If  g{  1)  =  2,  /( 2)  =  3,  ^(1)  =  4  and  f{ 2)  =  5,  find  the 
derivative  of  fo  g  at  1. 


Exercise  4.5.42  Express  the  derivative  of  g{x)  —  x2/(x2)  in  terms  of  f  and  the  derivative  of  f. 


4.6  Derivatives  of  Exponential  &  Logarithmic  Functions 


As  with  the  sine  function,  we  don’t  know  anything  about  derivatives  that  allows  us  to  compute  the  deriva¬ 
tives  of  the  exponential  and  logarithmic  functions  without  going  back  to  basics.  Let’s  do  a  little  work  with 
the  definition  again: 


ax+Ax-ax 

lim  - - - 

Ax— >-0  Ax 
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=  lim 

Ax— >0 


axa^-ax 


Ax 


=  lim  ax 

Ax— >0 


aAv  - 1 
Ax 


=  ax  lim 

Ax—s>0 


a^-l 

At 


There  are  two  interesting  things  to  note  here:  As  in  the  case  of  the  sine  function  we  are  left  with  a  limit 

that  involves  At  but  not  x,  which  means  that  if  lim  (aAx  —  1)/Ax  exists,  then  it  is  a  constant  number.  This 

Ax — ^0 

means  that  ax  has  a  remarkable  property:  its  derivative  is  a  constant  times  itself. 

We  earlier  remarked  that  the  hardest  limit  we  would  compute  is  limsinx/x  =  1;  we  now  have  a  limit 

x — ?*0 

that  is  just  a  bit  too  hard  to  include  here.  In  fact  the  hard  part  is  to  see  that  lim  ( a ,tXx  —  1  )/At  even  exists — 

Ax — ?*0 

does  this  fraction  really  get  closer  and  closer  to  some  fixed  value?  Yes  it  does,  but  we  will  not  prove  this 
fact. 

We  can  look  at  some  examples.  Consider  (2X  —  l)/x  for  some  small  values  of  x:  1,  0.828427124, 
0.756828460,0.724061864,0.70838051,0.70070877  whenxis  1, 1/2, 1/4, 1/8, 1/16, 1/32,  respectively. 
It  looks  like  this  is  settling  in  around  0.7,  which  turns  out  to  be  true  (but  the  limit  is  not  exactly  0.7). 
Consider  next  (3*-  l)/x:  2,  1.464101616,  1.264296052,  1.177621520,  1.13720773,  1.11768854,  at  the 
same  values  of  x.  It  turns  out  to  be  true  that  in  the  limit  this  is  about  1.1.  Two  examples  don't  establish  a 
pattern,  but  if  you  do  more  examples  you  will  find  that  the  limit  varies  directly  with  the  value  of  a:  bigger 
a ,  bigger  limit;  smaller  a ,  smaller  limit.  As  we  can  already  see,  some  of  these  limits  will  be  less  than  1  and 
some  larger  than  1.  Somewhere  between  a  —  2  and  a  —  3  the  limit  will  be  exactly  1;  the  value  at  which 
this  happens  is  called  e,  so  that 

Ax— >0  At 

As  you  might  guess  from  our  two  examples,  e  is  closer  to  3  than  to  2,  and  in  fact  e  ~  2.718. 

Now  we  see  that  the  function  ex  has  a  truly  remarkable  property: 


_,v+Ax  _  tc 

lim  A 

Ax— >0  At 


lim 

Ax— >0 


exgAx  _  e;x 

At 


e^-l 
lim  C  — - 

Ax->0  At 


eAx  -  1 
lim  - 

Ax->0  At 


=  e 


That  is,  ex  is  its  own  derivative,  or  in  other  words  the  slope  of  ex  is  the  same  as  its  height,  or  the  same  as  its 
second  coordinate:  The  function  /(x)  =  ex  goes  through  the  point  (z,ez)  and  has  slope  e  there,  no  matter 
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what  z  is.  It  is  sometimes  convenient  to  express  the  function  ex  without  an  exponent,  since  complicated 
exponents  can  be  hard  to  read.  In  such  cases  we  use  exp(x),  e.g.,  exp(l  +  x2)  instead  of  e1+x2. 

What  about  the  logarithm  function?  This  too  is  hard,  but  as  the  cosine  function  was  easier  to  do  once 
the  sine  was  done,  so  is  the  logarithm  easier  to  do  now  that  we  know  the  derivative  of  the  exponential 
function.  Let’s  start  with  logex,  which  as  you  probably  know  is  often  abbreviated  1  n  x  and  called  the 
“natural  logarithm”  function. 

Consider  the  relationship  between  the  two  functions,  namely,  that  they  are  inverses,  that  one  “undoes” 
the  other.  Graphically  this  means  that  they  have  the  same  graph  except  that  one  is  “flipped”  or  “reflected” 
through  the  line  y  —  x: 


Figure  4.3:  The  exponential  and  logarithmic  functions. 


This  means  that  the  slopes  of  these  two  functions  are  closely  related  as  well:  For  example,  the  slope  of  ex 
is  e  at  x  =  1;  at  the  corresponding  point  on  the  ln(x)  curve,  the  slope  must  be  \/e,  because  the  “rise”  and 
the  “run”  have  been  interchanged.  Since  the  slope  of  ex  is  e  at  the  point  (l,e),  the  slope  of  ln(x)  is  1/e  at 
the  point  (e,  1). 


Figure  4.4:  The  exponential  and  logarithmic  functions. 


More  generally,  we  know  that  the  slope  of  ex  is  ez  at  the  point  (z,ez),  so  the  slope  of  1  n  (x)  is  1  / ez  at 
(ez,z).  In  other  words,  the  slope  of  1  n  x  is  the  reciprocal  of  the  first  coordinate  at  any  point;  this  means  that 
the  slope  of  lnx  at  (x,  lnx)  is  1  /x.  The  upshot  is: 


d 

dx 


lnx 


1 

x 


We  have  discussed  this  from  the  point  of  view  of  the  graphs,  which  is  easy  to  understand  but  is  not  normally 
considered  a  rigorous  proof — it  is  too  easy  to  be  led  astray  by  pictures  that  seem  reasonable  but  that  miss 
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some  hard  point.  It  is  possible  to  do  this  derivation  without  resorting  to  pictures,  and  indeed  we  will  see 
an  alternate  approach  soon. 

Note  that  lnx  is  defined  only  for  x  >  0.  It  is  sometimes  useful  to  consider  the  function  In  |x|,  a  function 
defined  for  x  ^  0.  When  x  <  0,  In  |x|  =  ln(— x)  and 


d 

dx 


lnlxl 


i  >"(-*) =>1)4 


Thus  whether  x  is  positive  or  negative,  the  derivative  is  the  same. 

What  about  the  functions  ax  and  logax?  We  know  that  the  derivative  of  ax  is  some  constant  times  ax 
itself,  but  what  constant?  Remember  that  “the  logarithm  is  the  exponent”  and  you  will  see  that  a  =  elna. 
Then 


ax  =  (elnaY  =  exlna, 


and  we  can  compute  the  derivative  using  the  chain  rule: 


^-ax  =  ^-(elna)x  =  ^-exlna  =  (In  a)exlna  =  (In  a)ax 
dx  dx  dx 


The  constant  is  simply  In  a.  Likewise  we  can  compute  the  derivative  of  the  logarithm  function  logflx.  Since 


x  —  e 


\nx 


we  can  take  the  logarithm  base  a  of  both  sides  to  get 

logfl0)  =  log>ln*)  =  lnxlogae. 


Then 

TlogiiX=ilog(i,. 

This  is  a  perfectly  good  answer,  but  we  can  improve  it  slightly.  Since 


a 

log«(«) 

1 

1 

In  a 


elna 

loga(elna)  =  lnalogae 
\na\ogae 

log  ae. 


we  can  replace  loga  e  to  get 

d  ,  1 

:rloga*=  w — • 

dx  xma 

You  may  if  you  wish  memorize  the  formulas. 


146  Derivatives 


Because  the  “trick”  a  —  elna  is  often  useful,  and  sometimes  essential,  it  may  be  better  to  remember  the 
trick,  not  the  formula. 


Solution. 


d  _r 
~dx2 


d  ( 
dx 

d 

dx 


An2\x 


..Tin  2 


=  1—X\n2)exln2 
dx 


=  (In2)exlnl  =  2xln2 


Example  4.36:  Derivative  of  Exponential  Function 


Compute  the  derivative  of  f(x)  =  2X~  —  2(t  j. 


Solution. 


d  ryX2  d  cP  |  n  2 

dx  dx 

=  (21n2)xev2ln2 
=  (21n2)x2x2 


X 


Example  4.37:  Power  Rule 


Recall  that  we  have  not  justified  the  power  rule  except  when  the  exponent  is  a  positive  or  negative 
integer. 


Solution.  We  can  use  the  exponential  function  to  take  care  of  other  exponents. 

d  _  d  rinX 

dx  dx 
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=  [  — rlnx  )  e 

ax 


r  Inx 


=  V-J-V 


=  rx 


r—1 


X 


Exercises  for  Section  4.6 

Find  the  derivatives  of  the  functions. 

Exercise  4.6.1  3T 

^  ^  „  sinx 

Exercise  4.6.2  - 

ex 

Exercise  4.6.3  (ex)2 
Exercise  4.6.4  sin(ex) 

Exercise  4.6.5  esinx 
Exercise  4.6.6  xsmx 
Exercise  4.6.7  x3ex 
Exercise  4.6.8  x  +  2X 
Exercise  4.6.9  (l/3)x" 

Exercise  4.6.10  e4x/x 
Exercise  4.6.11  ln(x3  +  3x) 

Exercise  4.6.12  ln(cos(x)) 

Exercise  4.6.13  y/lnfx2)  /x 
Exercise  4.6.14  ln(sec(x) +tan(x)) 

cos(x) 


Exercise  4.6.15  x‘ 
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Exercise  4.6.16  xlnx 
Exercise  4.6.17  ln(ln(3x)) 

Exercise  4.6.18  1+1,n(,3^) 
l+ln(4x) 

Exercise  4.6.19  Find  the  value  of  a  so  that  the  tangent  line  to  y  —  ln(x)  at  x  —  a  is  a  line  through  the 
origin.  Sketch  the  resulting  situation. 

Exercise  4.6.20  If  /(x)  =  ln(x3  +  2)  compute  /'(e1,  3). 


4.7  Implicit  Differentiation 


As  we  have  seen,  there  is  a  close  relationship  between  the  derivatives  of  ex  and  lnx  because  these  functions 
are  inverses.  Rather  than  relying  on  pictures  for  our  understanding,  we  would  like  to  be  able  to  exploit  this 
relationship  computationally.  In  fact  this  technique  can  help  us  find  derivatives  in  many  situations,  not  just 
when  we  seek  the  derivative  of  an  inverse  function. 

We  will  begin  by  illustrating  the  technique  to  find  what  we  already  know,  the  derivative  of  lnx.  Let’s 
write  y  —  1  n  x  and  then  x  =  elnx  —  ey,  that  is,  x  —  ey .  We  say  that  this  equation  defines  the  function  y  =  lnx 
implicitly  because  while  it  is  not  an  explicit  expression  y  =  . . it  is  true  that  if  x  =  ey  then  y  is  in  fact  the 
natural  logarithm  function.  Now,  for  the  time  being,  pretend  that  all  we  know  of  y  is  that  x  —  ey;  what  can 
we  say  about  derivatives?  We  can  take  the  derivative  of  both  sides  of  the  equation: 

d  d  y 

dx  dx 

Then  using  the  chain  rule  on  the  right  hand  side: 


1 


ey  =  y'ey. 


Then  we  can  solve  for  V: 

y=4  =  f 

ey  x 

There  is  one  little  difficulty  here.  To  use  the  chain  rule  to  compute  d/dx(ey)  —  y'ey  we  need  to  know  that 
the  function  y  has  a  derivative.  All  we  have  shown  is  that  if  it  has  a  derivative  then  that  derivative  must 
be  1/x.  When  using  this  method  we  will  always  have  to  assume  that  the  desired  derivative  exists,  but 
fortunately  this  is  a  safe  assumption  for  most  such  problems. 

The  example  y  =  lnx  involved  an  inverse  function  defined  implicitly,  but  other  functions  can  be  defined 
implicitly,  and  sometimes  a  single  equation  can  be  used  to  implicitly  define  more  than  one  function. 
Here’s  a  familiar  example. 
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Example  4.38:  Derivative  of  Circle  Equation 


The  equation  r 2  =  x2  +  y2  describes  a  circle  of  radius  r.  The  circle  is  not  a  function  y  —  f(x)  because 
for  some  values  ofx  there  are  two  corresponding  values  ofy.  If  we  want  to  work  with  a  function,  we 
can  break  the  circle  into  two  pieces,  the  upper  and  lower  semicircles,  each  of  which  is  a  function. 
Let’s  call  these  y  —  U(x )  and  y  —  L(x) ;  in  fact  this  is  a  fairly  simple  example,  and  it’s  possible  to  give 
explicit  expressions  for  these:  U  (x)  —  \f  r2  —  x2  and  L(x  )  =  -^/r2  —  x2  .  But  it’s  somewhat  easier, 
and  quite  useful,  to  view  both  functions  as  given  implicitly  by  r 2  =  x2  +y2:  both  r2  —  x2  +  U (x)2 
and  r2  —  x2  +L(x)2  are  true,  and  we  can  think  of  r 2  =  x2  +y2  as  defining  both  U (x)  and  L(x) . 

Now  we  can  take  the  derivative  of  both  sides  as  before,  remembering  thaty  is  not  simply  a  variable 
but  a  function — in  this  case,  y  is  either  U (x)  or  L(x)  but  we’re  not  yet  specifying  which  one.  When 
we  take  the  derivative  we  just  have  to  remember  to  apply  the  chain  rule  where  y  appears. 


d  9 
dxr 
0 
/ 

y 


2x  +  2yy 
—2x  x 


2  y 


y 


Now  we  have  an  expression  for  y' ,  but  it  contains  y  as  well  as  x.  This  means  that  if  we  want  to 
compute  /  for  some  particular  value  ofx  we’ll  have  to  know  or  compute  y  at  that  value  ofx  as  well. 
It  is  at  this  point  that  we  will  need  to  know  whether  y  is  U  (x)  or  L(x).  Occasionally  it  will  turn  out 
that  we  can  avoid  explicit  use  ofU(x)  orL(x)  by  the  nature  of  the  problem. 


Solution.  Since  we  know  both  the  x  and  y  coordinates  of  the  point  of  interest,  we  do  not  need  to  explicitly 
recognize  that  this  point  is  on  L(x),  and  we  do  not  need  to  use  L(x)  to  compute  y  -  but  we  could.  Using 
the  calculation  of  /  from  above, 

/  x  11 

37  _  y  ~  -y/3  ~  y/3' 

It  is  instructive  to  compare  this  approach  to  others. 

We  might  have  recognized  at  the  start  that  ( 1 ,  —  \/3)  is  on  the  function  y  —  L(x)  —  —\j 4  —  x2.  We 
could  then  take  the  derivative  of  L(x),  using  the  power  rule  and  the  chain  rule,  to  get 

L'  M  =  -i(4-V)-‘/2(-2x)  =  -^=. 

Then  we  could  compute  Z/(  1)  =  1/  \/3  by  substituting  x  =  1. 

Alternately,  we  could  realize  that  the  point  is  on  L[x),  but  use  the  fact  that  y'  —  —x/y.  Since  the  point 
is  on  L(x)  we  can  replace  y  by  L(x)  to  get 
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without  computing  the  derivative  of  L(x)  explicitly.  Then  we  substitute  x  —  1  and  get  the  same  answer  as 
before.  X 

In  the  case  of  the  circle  it  is  possible  to  find  the  functions  U  (jc)  and  L(x)  explicitly,  but  there  are  poten¬ 
tial  advantages  to  using  implicit  differentiation  anyway.  In  some  cases  it  is  more  difficult  or  impossible  to 
find  an  explicit  formula  for  y  and  implicit  differentiation  is  the  only  way  to  find  the  derivative. 


Example  4.40:  Derivative  of  Function  defined  Implicitly 


Find  the  derivative  of  any  function  defined  implicitly  by  yx"  +  y  —  x. 


Solution.  We  treat  y  as  an  unspecified  function  and  use  the  chain  rule: 


,V+r) 

d 

dx 

(y  •  2x  +  y'  ■  x2)  +  2 yy' 

=  1 

y  ■  x2  +  2 yy' 

£ 

A 

i 

II 

/ 

—2  xy 

y 

x2  +  2y 

Example  4.41:  Derivative  of  Function  defined  Implicitly 


Find  the  derivative  of  any  function  defined  implicitly  by  yx2  +  ey  —  x. 


Solution.  We  treat  y  as  an  unspecified  function  and  use  the  chain  rule: 


(y-2x  +  V  •  x2)  +  y'ey 
yx2  +  yey 
y,{x2  +  ey) 
/ 

y 


d 

dx 

1 

1  —  2xy 
1  —  2xy 
1  —  2xy 
x2  +  ey 


X 


You  might  think  that  the  step  in  which  we  solve  for  /  could  sometimes  be  difficult — after  all,  we’re 
using  implicit  differentiation  here  because  we  can’t  solve  the  equation  yx2  +  ey  —  x  for  y,  so  maybe  after 
taking  the  derivative  we  get  something  that  is  hard  to  solve  for  /.  In  fact,  this  never  happens.  All 
occurrences  /  come  from  applying  the  chain  rule,  and  whenever  the  chain  rule  is  used  it  deposits  a  single 
y'  multiplied  by  some  other  expression.  So  it  will  always  be  possible  to  group  the  terms  containing  / 
together  and  factor  out  the  /,  just  as  in  the  previous  example.  If  you  ever  get  anything  more  difficult  you 
have  made  a  mistake  and  should  fix  it  before  trying  to  continue. 

It  is  sometimes  the  case  that  a  situation  leads  naturally  to  an  equation  that  defines  a  function  implicitly. 
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Example  4.42:  Equation  and  Derivative  of  Ellipse 


Discuss  the  equation  and  derivative  of  the  ellispe. 


Solution.  Consider  all  the  points  (x,y)  that  have  the  property  that  the  distance  from  (x,y)  to  (xi,yi)  plus 
the  distance  from  (x,y)  to  (*2, >’2)  is  2a  (a  is  some  constant).  These  points  form  an  ellipse,  which  like  a 
circle  is  not  a  function  but  can  be  viewed  as  two  functions  pasted  together.  Since  we  know  how  to  write 
down  the  distance  between  two  points,  we  can  write  down  an  implicit  equation  for  the  ellipse: 

^/(.v-xi)2  +  (y-yi)2  +  yj  (x-x2)2  +  (y-y2)2  =  2  a. 

Then  we  can  use  implicit  differentiation  to  find  the  slope  of  the  ellipse  at  any  point,  though  the  computation 
is  rather  messy.  X 


Solution.  Differentiating  both  sides  with  respect  to  x  gives: 


6x2  + 


o  ,  2 dy\  8 dy 

2.xy  +  x  —  -9  y  —  =  3, 
dx  j  dx 


-9/ 
■  9y8) 


dy 

dx 

dy 

dx 


3  —  6x2  —  2 xy 
3  —  6x2  —  2 xy 


dy  3  —  6x2  —  2xy 
dx  x2  —  9y8 


X 


In  the  previous  examples  we  had  functions  involving  x  and  y,  and  we  thought  of  y  as  a  function  of 
x.  In  these  problems  we  differentiated  with  respect  to  x.  So  when  faced  with  x’s  in  the  function  we 
differentiated  as  usual,  but  when  faced  withy’s  we  differentiated  as  usual  except  we  multiplied  by  a  ^  for 
that  term  because  we  were  using  Chain  Rule. 

In  the  following  example  we  will  assume  that  both  x  and  y  are  functions  of  t  and  want  to  differentiate 
the  equation  with  respect  to  t.  This  means  that  every  time  we  differentiate  an  x  we  will  be  using  the  Chain 
Rule,  so  we  must  multiply  by  4^,  and  whenever  we  differentiate  a  y  we  multiply  by 
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Example  4.44:  Derivative  of  Function  of  an  Additional  Variable 


Thinking  ofx  and  y  as  functions  of  t,  differentiate  the  following  equation  with  respect  to  t: 

x2+y2  =  100. 


Solution.  Using  the  Chain  Rule  we  have: 


„  dx  „  dy 
2x—  +  2y-~  =  0. 
dt  dt 


X 


Example  4.45:  Derivative  of  Function  of  an  Additional  Variable 


dx 


dy 


Ify  —  xr  +  5x  and  —  —  1,  find  —  when  x  =  1 . 


dt 


dt 


Solution.  Differentiating  each  side  of  the  equation  y  —  x3  +  5x  with  respect  to  t  gives: 

dy  9  dx  dx 
~r  =  3x  —  +  5— . 
dt  dt  dt 

When  x  =  1  and  ^  =  7  we  have: 


J-  =  3(12)(7)  +  5(7)  =  21  +35  =  56. 


X 


Logarithmic  Differentiation 


Previously  we’ve  seen  how  to  do  the  derivative  of  a  number  to  a  function  and  also  a  function  to  a 

number  \(f(x))n]'.  But  what  about  the  derivative  of  a  function  to  a  function  [(  f(x))8^]>r! 

In  this  case,  we  use  a  procedure  known  as  logarithmic  differentiation. 


Steps  for  Logarithmic  Differentiation 


•  Take  In  of  both  sides  of  y  =  f(x)  to  get  ln  y  =  \n  f{x)  and  simplify  using  logarithm  properties, 

dy 

•  Differentiate  implicitly  with  respect  to  x  and  solve  for  — , 

dx 

•  Replace  y  with  its  function  of  x  (i.e.,  f(x)). 
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Solution.  We  take  in  of  both  sides: 
Using  log  properties  we  have: 
Differentiating  implicitly  gives: 


Solving  for  /  gives: 


lnv  =  lnxL 
lny  =  x\nx. 

V  i 

=  (1)  lnx  +  x- 

y  X 

y' 

—  —  lnx  +  1. 

y 

y  =y(l  +lnx). 


Replace  y  —  xx  gives: 

y  =x*(l  +lnx). 

Another  method  to  find  this  derivative  is  as  follows: 

d 


= 

dx 


jdnx 


dx 


-f-xlnx  1  e 
dx  ) 

—  (x-  +  lnx)x* 

—  ( 1  +  lnx)x* 


jdnjt 


X 

In  fact,  logarithmic  differentiation  can  be  used  on  more  complicated  products  and  quotients  (not  just 
when  dealing  with  functions  to  the  power  of  functions). 


Solution.  Using  product  &  quotient  rules  for  this  problem  is  a  complete  nightmare!  Let’s  apply  logarithmic 
differentiation  instead.  Take  In  of  both  sides: 


/(x  +  2)3(2x+1)9\ 

V  x8(3x+ 1)4  )  ' 


lny  =  In 
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Applying  log  properties: 


lny  =  In  ((x  +  2)3(2x  +  l)9)  —  In  (x8(3x  +  l)4) . 

lny  =  In  ((x  +  2)3)  +  ln((2x+l)9)  -  [ln(x8)  +  ln  ((3x+ l)4)]  . 
lny  =  31n(x  +  2)  +  91n(2x  +  1)  —  81nx  — 41n(3x  +  1). 

Now,  differentiating  implicitly  with  respect  to  x  gives: 


Solving  fory'  gives: 


y_JS_  18  8  12 

y  x  +  2  2x  + 1  x  3x  + 1 

12 


y  =y 


3  18  8 

+ 


x  +  2  2x+l  x  3x+l 


r>  i  (x+2)3(2.t+l)9 

Replace  y  =  x8(y+1)+  gives: 


/  _  (x  +  2)3(2x+1)9  /  3  18  _  8  _  12  \ 

x8(3x+l)4  \x  +  2^~2x+l  x  3x+ 1  / 


X 


Exercises  for  Section  4.7 


Exercise  4.7.1  Find  a  formula  for  the  derivative  V  at  the  point  (x,y): 


(a)  y  =  1  +x 

(b)  x2  +xy  +  y2  =  7 


(c)  x3  +xy2  =  y3  +yx2 


(d)  4cosxsiny  =  1 


(e)  ^+^  =  9 

(f)  tan(x/y)=x  +  y 

(g)  sin(x  +  y)  =  xy 

1  1 

(h)  -  +  -  =  7 
x  y 


Exercise  4.7.2  A  hyperbola  passing  through  (8,6)  consists  of  all  points  whose  distance  from  the  origin  is 
a  constant  more  than  its  distance  from  the  point  (5,2).  Find  the  slope  of  the  tangent  line  to  the  hyperbola 
at  (8,6). 

Exercise  4.7.3  The  graph  of  the  equation  x  — xy+y  —9  is  an  ellipse.  Find  the  lines  tangent  to  this  curve 
at  the  two  points  where  it  intersects  the  x-axis.  Show  that  these  lines  are  parallel. 

Exercise  4.7.4  Repeat  the  previous  problem  for  the  points  at  which  the  ellipse  intersects  the  y-axis. 
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Exercise  4.7.5  If  y  —  logflx  then  ay  —  x.  Use  implicit  differentiation  to  find  y  . 

Exercise  4.7.6  Find  the  points  on  the  ellipse  from  the  previous  two  problems  where  the  slope  is  horizontal 
and  where  it  is  vertical. 

Exercise  4.7.7  Find  an  equation  for  the  tangent  line  to  x4  =  v2  +  x2  at  (2,  \/l2).  (This  curve  is  the 

kampyle  of  Eudoxus.) 

Exercise  4.7.8  Find  an  equation  for  the  tangent  line  to  x2/3  +  y2//3  =  cr/3  at  a  point  (x  | ,  y  i )  on  the  curve, 
with  x\  f  0  andy\  f  0.  (This  curve  is  an  as! mid. ) 

A  A  A  A  A 

Exercise  4.7.9  Find  an  equation  for  the  tangent  line  to  (x~  +y  )  —x~—  y  at  a  point  (xi  ,y  i )  on  the  curve, 
with  x\  /  0,  —  1 , 1 .  (This  curve  is  a  lemniscate.) 

Exercise  4.7.10  Two  curves  are  orthogonal  if  at  each  point  of  intersection,  the  angle  between  their 
tangent  lines  is  n /2.  Two  families  of  curves,  sf  and  £$,  are  orthogonal  trajectories  of  each  other  if  given 
any  curve  C  in  ■<//  and  any  curve  D  in  FS  the  curves  C  and  D  are  orthogonal.  For  example,  the  family  of 
horizontal  lines  in  the  plane  is  orthogonal  to  the  family  of  vertical  lines  in  the  plane. 

(a)  Show  that  x  —y  =  5  is  orthogonal  to  4x~  +  9y  =72.  (Flint:  You  need  to  find  the  intersection  points 
of  the  two  curx’es  and  then  show  that  the  product  of  the  derivatives  at  each  intersection  point  is  —  1.) 

9  9  9 

(b)  Show  that  x  +y  —r  is  orthogonal  to  y  —  mx.  Conclude  that  the  family  of  circles  centered  at  the 
origin  is  an  orthogonal  trajectory  of  the  family  of  lines  that  pass  through  the  origin. 

Note  that  there  is  a  technical  issue  when  m  —  0.  The  circles  fail  to  be  differentiable  when  they  cross 
the  x-axis.  However,  the  circles  are  orthogonal  to  the  x-axis.  Explain  why.  Likewise,  the  vertical 
line  through  the  origin  requires  a  separate  argument. 

(c)  For  k  f  0  and  c  f  0  show  that  y~  —  x~  —  k  is  orthogonal  to  yx  —  c.  In  the  case  where  k  and  c  are 

9  9 

both  zero,  the  curves  intersect  at  the  origin.  Are  the  curves  y  —  x~  =  0  and  yx  =  0  orthogonal  to 
each  other? 

(d)  Suppose  that  m  f  0.  Show  that  the  family  ofcurx’es  {y  =  mx  +  b  \  b  G  M}  is  orthogonal  to  the  family 
ofcun’es  {y  =  —(x/m)  +c  |  c  G  M}. 


Exercise  4.7.11 

Exercise  4.7.12 

Exercise  4.7.13 


rv/y.  •  If-  (X-  1)8(>-23)1/2 
Differentiate  the  junction  y  —  — ^  ^ — 

Differentiate  the  function  f(x)  =  (x  +  1  )SI,1A. 

eY(cosx  +  2)3 


Differentiate  the  function  g(x)  = 


Vx2  +  4 
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4.8  Derivatives  of  Inverse  Functions 


Suppose  we  wanted  to  find  the  derivative  of  the  inverse ,  but  do  not  have  an  actual  formula  for  the  inverse 
function?  Then  we  can  use  the  following  derivative  formula  for  the  inverse  evaluated  at  a. 


Derivative  of  / *  1  (a) 


Given  an  invertible  function  /(x) ,  the  derivative  of  its  inverse  function  /  1  (x)  evaluated  at  x  —  a  is: 


^  ^  {a)  f'lf-'W] 


To  see  why  this  is  true,  start  with  the  function  y  —  f  1  (x) .  Write  this  as  x  —  f(y)  and  differentiate  both 
sides  implicitly  with  respect  to  x  using  the  chain  rule: 


Thus, 


buty  =  /  1  (x),  thus, 


At  the  point  x  —  a  this  becomes: 


1  =/'(>’)• 


dy 

dx 


dy  1 
dx  fiy)  ’ 

[/  w  =  /'[/-1  wj 

[/  (0)  =  /'[/-‘(a)] 


Solution.  First  we  should  show  that  /  1  exists  (i.e.  that  /  is  one-to-one).  In  this  case  the  derivative 
f{x)  —  5x4  +  6v2  +  7  is  strictly  greater  than  0  for  all  x,  so  /  is  strictly  increasing  and  thus  one-to-one. 

It’s  difficult  to  find  the  inverse  of  f(x)  (and  then  take  the  derivative).  Thus,  we  use  the  above  formula 
evaluated  at  1 : 

[/ ']  (1)  =  /[/-i(i)]' 

Note  that  to  use  this  formula  we  need  to  know  what  1  ( 1 )  is,  and  the  derivative  f{x).  To  find  /_1(  1) 
we  make  a  table  of  values  (plugging  in  x  =  —3,  —2,  —  1,0, 1,2,3  into  /(x))  and  see  what  value  of  x  gives 
1.  We  omit  the  table  and  simply  observe  that  /( 0)  =  1.  Thus, 

r\  i)=o. 
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Now  we  have: 

[r'],(1)  =  7W 

And  so,  /'( 0)  =  7.  Therefore, 

[r‘]'(i)  =  f 

* 


4.8.1.  Derivatives  of  Inverse  Trigonometric  Functions 


We  can  apply  the  technique  used  to  find  the  derivative  of  /  1  above  to  find  the  derivatives  of  the  inverse 
trigonometric  functions. 

In  the  following  examples  we  will  derive  the  formulae  for  the  derivative  of  the  inverse  sine,  inverse 
cosine  and  inverse  tangent.  The  other  three  inverse  trigonometric  functions  have  been  left  as  exercises  at 
the  end  of  this  section. 


Solution.  As  above,  we  write  y  =  sin  1  (jc),  sox=  sin(y)  and  —%/ 2  <J<  tt/2,  and  differentiate  both 
sides  with  respect  to  x  using  the  chain  rule. 


d 

dx 

1 

1 

dy 

dx 


d  .  ,  . 
-smW 

/  \  dy 
c°sW- 


cos 


(Sm->W)g 

1 


cos  (sin  1  (jc)) 


Although  correct,  this  formula  is  cumbersome  to  use,  and  can  be  simplified  significantly  with  a  bit  of 
trigonometry.  Let  9  —  sin-1  (A),  so  sin(0)  =  x,  and  construct  a  right  angle  triangle  with  angle  0,  opposite 
side  length  x  and  hypotenuse  1.  The  Pythagorean  Theorem  gives  an  adjacent  side  length  of  \f  1  —x2, 
so  cos  (sin  1  (jc))  =  cos(0)  =  \J  1—x2.  Note  that  we  choose  the  non-negative  square  root  x/ 1  —  x2  since 
cos(0)  >  0  when  —n/ 2  <9  <  tz/2. 

Finally,  the  derivative  of  inverse  sine  is 


(sin  '(x))'  = 


1 


\J  1  —  x2 


X 
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Solution.  Let  y  —  cos  1  (x),  so  cos(y)  =  x  and  0  <  y  <  K.  Next  we  differentiate  implicitly: 


d 

dx 


(cosy) 


—  siny  • 


dy 

dx 

dy 

dx 


d 

dx 


(*) 


i 


l 

siny 


This  time,  just  for  variety,  we  will  leave  the  derivative  in  terms  of  y  and  apply  some  trigonometry. 
Since  cosy  =  x,  we  construct  a  triangle  with  angle  y,  adjacent  side  length  x  and  hypotenuse  1.  Solving  for 
the  opposite  side  length  using  Pythagorean  Theorem  we  obtain  x/ 1  —  x2.  Using  this  triangle  we  can  see 
that  siny  =  \J  1  —  x2  (0  <J<  k).  Substituting  this  into  the  equation  for  dy/dx,  we  find  that 


d 

dx 


M 


L(coS->W) 


X 


In  the  following  example  we  explore  an  alternate  method  of  finding  the  derivative. 


Solution.  We  begin  with  tan  (tan  1  (x))  =  x.  Taking  the  derivative  using  the  Chain  Rule  we  obtain 

sec2  (tan_1(x))  •  (tan~!(x))  =  1, 

Cl  si 

which  we  rearrange  to  obtain 

-j-  (tan_1(x))  =  — 1  ... 

dx  v  y  sec2  (tan  1  (x)) 

Let  tan  1  (x)  =  0,  then  tan(0)  =  x.  We  construct  a  triangle  with  angle  0,  adjacent  side  1  and  oppo¬ 
site  side  x.  The  hypotenuse  is  vT+x2  using  Pythagorean  theorem.  Then  sec2  (tan  -■w)  =  sec2(0)  = 

(sec(0))2  =  (x/l+x2)  =  1  +x2.  Recall  that  sec(x)  —  \j  cos(x).  Finally,  the  derivative  is 


d 

dx 


(tan  !(x)) 


1 

1  Tx2 


X 
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Exercises  for  4.8 


Exercise  4.8.1  Given  f{x)  —  1  +  ln(x  —  2),  first  show  that  f  1  exists,  then  compute  [f  '] r  ( 1 ). 

Exercise  4.8.2  The  inverse  cotangent  function,  denoted  by  cot  1  (x),  is  defined  to  be  the  inverse  of  the 
restricted  cotangent  function:  cot(x),  0  <x  <  K.  Find  the  derivative  of  cot  '(x). 

Exercise  4.8.3  The  inverse  secant  function,  denoted  by  sec-1  (A),  is  defined  to  be  the  inverse  of  the 
restricted  secant  function:  sec(x),  x  G  [0,  7t/2)  U  [7T,37t/2).  Find  the  derivative  of  see  '(x). 

Exercise  4.8.4  The  inverse  cosecant  function,  denoted  by  esc- 1  (x),  is  defined  to  be  the  inverse  of  the 
restricted  cosecant  function:  csc(x),  x  G  (0,  7t/2]  U  (  tt,  3tt/2]  .  Find  the  derivative  of  csc_1(x). 

Exercise  4.8.5  Suppose  /(x)  =  x3  +4x  +  2.  Find  the  slope  of  the  tangent  line  to  the  graph  of  g(x)  — 
xf  1  (x)  at  the  point  where  x  =  7. 

Exercise  4.8.6  Find  the  derivatives  of  sin-1  (x)  +  cos-1  (x)  and  (x2  +  1 )  tan  1  (x). 

Exercise  4.8.7  Differentiate  y  =  sin  '(x2)  and  y  —  tan  1  (3x). 

4.9  Additional  Exercises 


Exercise  4.9.1  Find  the  derivatives  of  the  following  functions  from  definition. 

(a)  f(x)  =  (2x  +  3)2 

(b)  g(x)  =x 3/2 


Exercise  4.9.2  Let  /(x) 


x 

5x  —  x2 


ifx  <  1 
ifx  >  1 


.  Use  the  definition  of  the  derivative  to  find  f'{  1 ). 


Exercise  4.9.3  Differentiate  the  following  functions. 


( a)  y  —  7x4  —  7  7T4  + 


nf/x 


(b)  fix) 


1  ~  y/x 
1  +  y/x 


(c)  /(x)  =  |x— l|  +  |x  +  2 

(d)  f(x)—x2 sinxcosx 
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(e)  y 


xsinx 
1  +  sinx 


(f)  g{x) 


3 


<g)y  =  ^+xt+i  +  {x3_'x+4)5 

(h)  y  =  sin3x  —  sin(x3) 

(i)  F(x)  —  sec4x  +  tan4x 

Wy= cos2(rs) 

(k)  y  —  tan(sin(x2  +  sec2x)) 


(l)  y  = 


1 

2  +  sin  | 


Exercise  4.9.4  Differentiate  the  following  functions. 

(a)  y  —  e3x  +  e~x  +  e2 

(b)  y  —  elx  cos3x 

(c)  f(x)=  tan(x  +  eY) 

(d)  g(x)  = 

e*  +  2 

fej  y  —  ln(2  +  sinx)  —  sin(2  +  lnx) 

(f)  f(x)  =  ex K  +  xn<!  +  ne' 

(g)  y  =  logfl(M)  +  blogaX,  where  a  and  b  are  positive  real  numbers  and  a  f  1 

(h)  y  =  (x2  +  i  y3+l 

(i)  y=(x2  +  ex)l/lnx 

=  X\J X2  +  X  +  1 

J  y  (2  +  sinx)4(3x  +  5)7 


Exercise  4.9.5  Find 


dy 

dx 


ify  is  a  differentiable  function  that  satisfy  the  given  equation. 


(a)  x2+xy  +  y2  =  7 

(b)  x2+yl  =  {2x2  +  2y2-x)2 
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(c)  x-siny  +  y3  =  cosx 

(d)  x2+xey  =  2y  +  ex 


Exercise  4.9.6  Differentiate  the  following  functions. 


(a)  y  —  xsin  1  x 


(b)  f(x)  = 


sin  x 

COS”1* 


(c)  g(x)  —  tan  1  where 


a  >  0 


(d)y  —  xtan  lx—  -ln(x2  +  l) 


5.  Applications  of  Derivatives 


In  this  chapter  we  explore  how  to  use  derivative  and  differentiation  to  solve  a  variety  of  problems,  some 
mathematical  and  some  practical.  We  explore  some  applications  which  motivated  and  were  formalized 
in  the  definition  of  the  derivative,  and  look  at  a  few  clever  uses  of  the  tangent  line  (which  has  immediate 
geometric  ties  to  the  definition  of  the  derivative). 


5.1  Related  Rates 


When  defining  the  derivative  f  (x),  we  define  it  to  be  exactly  the  rate  of  change  of  f(x)  with  respect  to 
x.  Consequently,  any  question  about  rates  of  change  can  be  rephrased  as  a  question  about  derivatives. 
When  we  calculate  derivatives,  we  are  calculating  rates  of  change.  Results  and  answers  we  obtain  for 
derivatives  translate  directly  into  results  and  answers  about  rates  of  change.  Let  us  look  at  some  examples 
where  more  than  one  variable  is  involved,  and  where  our  job  is  to  analyze  and  exploit  relations  between 
the  rates  of  change  of  these  variables.  The  mathematical  step  of  relating  the  rates  of  change  turns  out  to 
be  largely  an  exercise  in  differentiation  using  the  chain  rule  or  implicit  differentiation.  This  explains  why 
some  textbooks  place  this  section  shortly  after  the  sections  on  the  chain  rule  and  implicit  differentiation. 

Suppose  we  have  two  variables  x  and  y  (in  most  problems  the  letters  will  be  different,  but  for  now 
let’s  use  x  and  y)  which  are  both  changing  with  time.  A  “related  rates”  problem  is  a  problem  in  which 
we  know  one  of  the  rates  of  change  at  a  given  instant — say,  x  —  dx/dt — and  we  want  to  find  the  other 
rate  y  —  dy/dt  at  that  instant.  (The  use  of  x  to  mean  dx/dt  goes  back  to  Newton  and  is  still  used  for  this 
purpose,  especially  by  physicists.) 

If  y  is  written  in  terms  of  x,  i.e.,  y  —  f(x),  then  this  is  easy  to  do  using  the  chain  rule: 

dy  dy  dx  dy . 
dt  dx  dt  dx 

That  is,  find  the  derivative  of  f(x),  plug  in  the  value  of  x  at  the  instant  in  question,  and  multiply  by  the 
given  value  of  x  —  dx/dt  to  get  y  —  dy/dt. 


Example  5.1:  Speed  at  which  a  Coordinate  is  Changing 


Suppose  an  object  is  moving  along  a  path  described  by  y  —  x2,  that  is,  it  is  moving  on  a  parabolic 
path.  At  a  particular  time,  say  t  —  5,  the  x  coordinate  is  6  and  we  measure  the  speed  at  which  the  x 
coordinate  of  the  object  is  changing  and  find  that  dx/ dt  =  3. 

At  the  same  time,  how  fast  is  the  y  coordinate  changing? 


Solution.  Using  the  chain  rule,  dy/dt  —  2x  ■  dx/dt.  At  t  —  5  we  know  that  x  —  6  and  dx/dt  —  3,  so 
dy/dt  —  2  •  6  ■  3  =  36.  4k 
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In  many  cases,  particularly  interesting  ones,  x  and  y  will  be  related  in  some  other  way,  for  example 
x  —  f(y),  or  F(x,y)  —  k,  or  perhaps  F(x,y)  =  G(x,y),  where  F(x,y )  and  G(x,y)  are  expressions  involving 
both  variables.  In  all  cases,  you  can  solve  the  related  rates  problem  by  taking  the  derivative  of  both  sides, 
plugging  in  all  the  known  values  (namely,  x,  y,  and  i),  and  then  solving  for  y. 

To  summarize,  here  are  the  steps  in  doing  a  related  rates  problem. 


Example  5.2:  Receding  Airplanes 


A  plane  is  Hying  directly  away  from  you  at  500  mph  at  an  altitude  of  3  miles.  How  fast  is  the  plane ’s 
distance  from  you  increasing  at  the  moment  when  the  plane  is  Hying  over  a  point  on  the  ground  4 
miles  from  you? 


Solution.  To  see  what’s  going  on,  we  first  draw  a  schematic  representation  of  the  situation,  as  in  Figure  5.1. 

Because  the  plane  is  in  level  flight  directly  away  from  you,  the  rate  at  which  x  changes  is  the  speed  of 
the  plane,  dx/dt  —  500.  The  distance  between  you  and  the  plane  is  y;  it  is  dy/dt  that  we  wish  to  know.  By 

9  9 

the  Pythagorean  Theorem  we  know  that  x  +  9  =  y".  Taking  the  derivative: 


2xx  =  2 yy. 


9  9 

We  are  interested  in  the  time  at  which  x  —  4;  at  this  time  we  know  that  4”  +  9  =  y  ,  so  y  =  5.  Putting 
together  all  the  information  we  get 

2(4)  (500)  =  2(5)y. 

Thus,  y  —  400  mph.  X 


3 


x 


Figure  5.1:  Receding  airplane. 
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Example  5.3:  Spherical  Balloon 


You  are  inflating  a  spherical  balloon  at  the  rate  of  7  cm3 /sec.  How  fast  is  its  radius  increasing  when 
the  radius  is  4  cm? 


Solution.  Here  the  variables  are  the  radius  r  and  the  volume  V.  We  know  dV /dt,  and  we  want  dr/dt.  The 
two  variables  are  related  by  the  equation  V  =  4nr3 /3.  Taking  the  derivative  of  both  sides  gives  dV /dt  = 
47 ir~f.  We  now  substitute  the  values  we  know  at  the  instant  in  question:  7  =  4n42r,  so  r  =  7/(64tt) 
cm/sec.  & 


Example  5.4:  Conical  Container 


Water  is  poured  into  a  conical  container  at  the  rate  of  10  cm3 /sec.  The  cone  points  directly  down, 
and  it  has  a  height  of  30  cm  and  a  base  radius  of  10  cm;  see  Figure  5.2.  How  fast  is  the  water  level 
rising  when  the  water  is  4  cm  deep  (at  its  deepest  point)? 


Solution.  The  water  forms  a  conical  shape  within  the  big  cone;  its  height  and  base  radius  and  volume  are 
all  increasing  as  water  is  poured  into  the  container.  This  means  that  we  actually  have  three  things  varying 
with  time:  the  water  level  h  (the  height  of  the  cone  of  water),  the  radius  r  of  the  circular  top  surface  of 
water  (the  base  radius  of  the  cone  of  water),  and  the  volume  of  water  V.  The  volume  of  a  cone  is  given  by 
V  =  itrh/3.  We  know  dV /dt,  and  we  want  dli/dt.  At  first  something  seems  to  be  wrong:  we  have  a  third 
variable,  r,  whose  rate  we  don’t  know. 

However,  the  dimensions  of  the  cone  of  water  must  have  the  same  proportions  as  those  of  the  container. 
That  is,  because  of  similar  triangles,  r/h  —  10/30  so  r  =  h/3.  Now  we  can  eliminate  r  from  the  problem 
entirely:  V  —  Tt(h/3)2h/3  =  Kh3 /21 .  We  take  the  derivative  of  both  sides  and  plug  in  h  =  4  and  dV  /dt  — 
10,  obtaining  10  =  {3%  ■  42 /21)(dh/dt).  Thus,  dh/dt  —  90/ (  16tt)  cm/sec. 


<-  10  ->• 


Figure  5.2:  Conical  water  tank. 
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A  swing  consists  of  a  board  at  the  end  of  a  10  ft  long  rope.  Think  of  the  board  as  a  point  P  at  the  end 
of  the  rope,  and  let  Q  be  the  point  of  attachment  at  the  other  end.  Suppose  that  the  swing  is  directly 
below  Q  at  time  t  =  0,  and  is  being  pushed  by  someone  who  walks  at  6  ft/sec  from  left  to  right. 
Find  (a)  how  fast  the  swing  is  rising  after  1  sec;  (b)  the  angular  speed  of  the  rope  in  deg/sec  after  1 
sec. 


Solution.  We  start  out  by  asking:  What  is  the  geometric  quantity  whose  rate  of  change  we  know,  and  what 
is  the  geometric  quantity  whose  rate  of  change  we’re  being  asked  about?  Note  that  the  person  pushing 
the  swing  is  moving  horizontally  at  a  rate  we  know.  In  other  words,  the  horizontal  coordinate  of  P  is 
increasing  at  6  ft/sec.  In  the  xy-plane  let  us  make  the  convenient  choice  of  putting  the  origin  at  the  location 
of  P  at  time  t  =  0,  i.e.,  a  distance  10  directly  below  the  point  of  attachment.  Then  the  rate  we  know  is 
dx/dt,  and  in  part  (a)  the  rate  we  want  is  dy/dt  (the  rate  at  which  P  is  rising).  In  part  (b)  the  rate  we 
want  is  0  —  dQ/dt ,  where  0  stands  for  the  angle  in  radians  through  which  the  swing  has  swung  from  the 
vertical.  (Actually,  since  we  want  our  answer  in  deg/sec,  at  the  end  we  must  convert  dO /dt  from  rad/sec 
by  multiplying  by  180/ tt.) 

(a)  From  the  diagram  we  see  that  we  have  a  right  triangle  whose  legs  are  x  and  10  —  y,  and  whose  hy¬ 
potenuse  is  10.  Hence  x2  +  ( 10  —  y)2  —  100.  Taking  the  derivative  of  both  sides  we  obtain:  2xx  +  2(10  — 
y)(0  —  y)  =  0.  We  now  look  at  what  we  know  after  1  second,  namely  x  =  6  (because  x  started  at  0  and 
has  been  increasing  at  the  rate  of  6  ft/sec  for  1  sec),  thus  y  —  2  (because  we  get  10  —  y  —  8  from  the 
Pythagorean  theorem  applied  to  the  triangle  with  hypotenuse  10  and  leg  6),  and  x  =  6.  Putting  in  these 
values  gives  us  2  •  6  •  6  —  2  •  8y  =  0,  from  which  we  can  easily  solve  for  y:  y  —  4.5  ft/sec. 

(b)  Here  our  two  variables  are  x  and  0,  so  we  want  to  use  the  same  right  triangle  as  in  part  (a),  but  this 

time  relate  0  to  x.  Since  the  hypotenuse  is  constant  (equal  to  10),  the  best  way  to  do  this  is  to  use  the 
sine:  sin0  =  x/10.  Taking  derivatives  we  obtain  (cos  0)0  =  O.lx.  At  the  instant  in  question  (1=1  sec), 
when  we  have  a  right  triangle  with  sides  6-8-10,  cos0  =  8/10  and  x  —  6.  Thus  (8/10)0  =  6/10,  i.e., 
0  =  6/  8  =  3/4  rad/sec,  or  approximately  43  deg/sec.  & 


Q 


p 


Figure  5.3:  Swing. 


We  have  seen  that  sometimes  there  are  apparently  more  than  two  variables  that  change  with  time,  but 
in  reality  there  are  just  two,  as  the  others  can  be  expressed  in  terms  of  just  two.  However  sometimes  there 
really  are  several  variables  that  change  with  time;  as  long  as  you  know  the  rates  of  change  of  all  but  one  of 
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them  you  can  find  the  rate  of  change  of  the  remaining  one.  As  in  the  case  when  there  are  just  two  variables, 
take  the  derivative  of  both  sides  of  the  equation  relating  all  of  the  variables,  and  then  substitute  all  of  the 
known  values  and  solve  for  the  unknown  rate. 


Example  5.6:  Distance  Changing  Rate 


A  road  running  north  to  south  crosses  a  road  going  east  to  west  at  the  point  P.  Car  A  is  driving 
north  along  the  first  road,  and  car  B  is  driving  east  along  the  second  road.  At  a  particular  time  car  A 
is  10  kilometers  to  the  north  of  P  and  traveling  at  80  km/hr,  while  car  B  is  15  kilometers  to  the  east 
of  P  and  traveling  at  100  km/hr.  How  fast  is  the  distance  between  the  two  cars  changing? 


Solution.  Let  a(t)  be  the  distance  of  car  A  north  of  P  at  time  t,  and  b(t)  the  distance  of  car  B  east  of 
P  at  time  t,  and  let  c(l)  be  the  distance  from  car  A  to  car  B  at  time  t.  By  the  Pythagorean  Theorem, 
c(t)2  =  a(t)2  +  b(t)2 .  Taking  derivatives  we  get  2 c(t)c'(t)  =  2a(t)a'(t)  +2 b(t)b'(t),  so 


ad  +  bb 

c  — - 

c 


Substituting  known  values  we  get: 

10-  80+  15  •  100 

C_  \/l02  +  152 

at  the  time  of  interest. 


ad  Abb 
Va2  +  b2 


460 

\/T3 


127.6km/hr 


X 


(0  ,a(t))w 


c(t) 


P 


o) 


Figure  5.4:  Cars  moving  apart. 


Notice  how  this  problem  differs  from  Example  5.2.  In  both  cases  we  started  with  the  Pythagorean 
Theorem  and  took  derivatives  on  both  sides.  However,  in  Example  5.2  one  of  the  sides  was  a  constant  (the 
altitude  of  the  plane),  and  so  the  derivative  of  the  square  of  that  side  of  the  triangle  was  simply  zero.  In  this 
Example,  on  the  other  hand,  all  three  sides  of  the  right  triangle  are  variables,  even  though  we  are  interested 
in  a  specific  value  of  each  side  of  the  triangle  (namely,  when  the  sides  have  lengths  10  and  15).  Make  sure 
that  you  understand  at  the  start  of  the  problem  what  are  the  variables  and  what  are  the  constants. 


Exercises  for  Section  5.1 


Exercise  5.1.1  Air  is  being  pumped  into  a  spherical  balloon  at  a  constant  rate  of  3  cm3/s.  How  fast  is  the 
radius  of  the  balloon  increasing  when  the  radius  reaches  5cm? 
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Exercise  5.1.2  A  cylindrical  tank  standing  upright  (with  one  circular  base  on  the  ground)  has  radius  20 
cm.  How  fast  does  the  water  level  in  the  tank  drop  when  the  water  is  being  drained  at  25  cm3 /sec? 

Exercise  5.1.3  A  cylindrical  tank  standing  upright  ( with  one  circular  base  on  the  ground )  has  radius  1 
meter.  How  fast  does  the  water  level  in  the  tank  drop  when  the  water  is  being  drained  at  3  liters  per 
second? 

Exercise  5.1.4  A  ladder  13  meters  long  rests  on  horizontal  ground  and  leans  against  a  vertical  wall.  The 
foot  of  the  ladder  is  pulled  away  from  the  wall  at  the  rate  of  0.6  m/sec.  How  fast  is  the  top  sliding  down 
the  wall  when  the  foot  of  the  ladder  is  5  mfrom  the  wall? 

Exercise  5.1.5  A  ladder  13  meters  long  rests  on  horizontal  ground  and  leans  against  a  vertical  wall.  The 
top  of  the  ladder  is  being  pulled  up  the  wall  at  0.1  meters  per  second.  How  fast  is  the  foot  of  the  ladder 
approaching  the  wall  when  the  foot  of  the  ladder  is  5  mfrom  the  wall? 

Exercise  5.1.6  A  rotating  beacon  is  located  2  miles  out  in  the  water.  Let  A  be  the  point  on  the  shore  that 
is  closest  to  the  beacon.  As  the  beacon  rotates  at  10  rev/min,  the  beam  of  light  sweeps  down  the  shore 
once  each  time  it  revolves.  Assume  that  the  shore  is  straight.  How  fast  is  the  point  where  the  beam  hits  the 
shore  moving  at  an  instant  when  the  beam  is  lighting  up  a  point  2  miles  along  the  shore  from  the  point  A? 

Exercise  5.1.7  A  baseball  diamond  is  a  square  90  ft  on  a  side.  A  player  runs  from  first  base  to  second 
base  at  15  ft/ sec.  At  what  rate  is  the  player’s  distance  from  third  base  decreasing  when  she  is  halfway 
from  first  to  second  base? 

Exercise  5.1.8  Sand  is  poured  onto  a  surface  at  15  cm3 /sec,  forming  a  conical  pile  whose  base  diameter 
is  always  equal  to  its  altitude.  How  fast  is  the  altitude  of  the  pile  increasing  when  the  pile  is  3  cm  high? 

Exercise  5.1.9  A  boat  is  pulled  in  to  a  dock  by  a  rope  with  one  end  attached  to  the  front  of  the  boat  and 
the  other  end  passing  through  a  ring  attached  to  the  dock  at  a  point  5  ft  higher  than  the  front  of  the  boat. 
The  rope  is  being  pulled  through  the  ring  at  the  rate  of  0.6  ft/sec.  How  fast  is  the  boat  approaching  the 
dock  when  13  ft  of  rope  are  out? 

Exercise  5.1.10  A  balloon  is  at  a  height  of  50  meters,  and  is  rising  at  the  constant  rate  of  5  m/sec.  A 
bicyclist  passes  beneath  it,  traveling  in  a  straight  line  at  the  constant  speed  of  10  m/sec.  How  fast  is  the 
distance  between  the  bicyclist  and  the  balloon  increasing  2  seconds  later? 

Exercise  5.1.11  A  pyramid-shaped  vat  has  square  cross-section  and  stands  on  its  tip.  The  dimensions 
at  the  top  are  2  m  x  2  m,  and  the  depth  is  5  m.  If  water  is  flowing  into  the  vat  at  3  nf’/min,  how  fast 
is  the  water  level  rising  when  the  depth  of  water  (at  the  deepest  point)  is  4  m?  Note:  the  volume  of  any 
“conical”  shape  (including  pyramids)  is  (1/  3)  (height)  (area  of  base). 

Exercise  5.1.12  A  woman  5  ft  tall  walks  at  the  rate  of  3.5  ft/sec  away  from  a  streetlight  that  is  12  ft  above 
the  ground.  At  what  rate  is  the  tip  of  her  shadow  moving?  At  what  rate  is  her  shadow  lengthening? 

Exercise  5.1.13  A  man  1.8  meters  tall  walks  at  the  rate  of  1  meter  per  second  toward  a  streetlight  that 
is  4  meters  above  the  ground.  At  what  rate  is  the  tip  of  his  shadow  moving?  At  what  rate  is  his  shadow 
shortening? 
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Exercise  5.1.14  A  police  helicopter  is  flying  at  150  mph  at  a  constant  altitude  of  0.5  mile  above  a  straight 
road.  The  pilot  uses  radar  to  determine  that  an  oncoming  car  is  at  a  distance  of  exactly  1  mile  from  the 
helicopter,  and  that  this  distance  is  decreasing  at  190  mph.  Find  the  speed  of  the  car. 

Exercise  5.1.15  A  police  helicopter  is  flying  at  200  kilometers  per  hour  at  a  constant  altitude  of  1  km 
above  a  straight  road.  The  pilot  uses  radar  to  determine  that  an  oncoming  car  is  at  a  distance  of  exactly  2 
kilometers  from  the  helicopter,  and  that  this  distance  is  decreasing  at  250  kph.  Find  the  speed  of  the  car. 

Exercise  5.1.16  A  light  shines  from  the  top  of  a  pole  20  m  high.  An  object  is  dropped  from  the  same  height 
from  a  point  10  m  away,  so  that  its  height  at  time  t  seconds  is  h(t )  =  20  —  9.8r/2.  How  fast  is  the  object’s 
shadow  moving  on  the  ground  one  second  later? 


5.2  Extrema  of  a  Function 


In  calculus,  there  is  much  emphasis  placed  on  analyzing  the  behaviour  of  a  function  /  on  an  interval  1. 
Does  /  have  a  maximum  value  on  /?  Does  it  have  a  minimum  value?  How  does  the  interval  /  impact  our 
discussion  of  extrema? 

5.2.1.  Local  Extrema 


A  local  maximum  point  on  a  function  is  a  point  (x,y)  on  the  graph  of  the  function  whose  y  coordinate  is 
larger  than  all  other  y  coordinates  on  the  graph  at  points  “close  to”  (v.y).  More  precisely,  (x,f(x))  is  a 
local  maximum  if  there  is  an  interval  (a,  b)  with  a  <x<b  and  f(x)  >  f{z)  for  every  z  in  (a,  b) .  Similarly, 
(x, yj  is  a  local  minimum  point  if  it  has  locally  the  smallest  y  coordinate.  Again  being  more  precise: 
(jc,/(jc))  is  a  local  minimum  if  there  is  an  interval  (a,b)  with  a  <  x  <  b  and  f(x)  <  f(z)  for  every  z  in 
(a,  b) .  A  local  extremum  is  either  a  local  minimum  or  a  local  maximum. 


Definition  5.7:  Local  Maxima  and  Minima 


A  real-valued  function  f  has  a  local  maximum  at  xq  if  f(x o)  is  the  largest  value  of  f  near  xq;  in 
other  words,  f(x o  >  f(x)  when  x  is  nearx o- 

A  real-valued  function  f  has  a  local  minimum  atx  o  if  f(x  o)  is  the  smallest  value  off  nearx  o;  in 
other  words,  f(xo)  <  f  (x)  when  x  is  nearxo. 


Local  maximum  and  minimum  points  are  quite  distinctive  on  the  graph  of  a  function,  and  are  therefore 
useful  in  understanding  the  shape  of  the  graph.  In  many  applied  problems  we  want  to  find  the  largest  or 
smallest  value  that  a  function  achieves  (for  example,  we  might  want  to  find  the  minimum  cost  at  which 
some  task  can  be  performed)  and  so  identifying  maximum  and  minimum  points  will  be  useful  for  applied 
problems  as  well.  Some  examples  of  local  maximum  and  minimum  points  are  shown  in  Figure  5.5. 


170 


Applications  of  Derivatives 


Figure  5.5:  Some  local  maximum  points  (A)  and  minimum  points  ( B ). 


If  (x,f(x))  is  a  point  where  f(x)  reaches  a  local  maximum  or  minimum,  and  if  the  derivative  of  / 
exists  at  x,  then  the  graph  has  a  tangent  line  and  the  tangent  line  must  be  horizontal.  This  is  important 
enough  to  state  as  a  theorem. 

The  proof  is  simple  enough  and  we  include  it  here,  but  you  may  accept  Fermat’s  Theorem  based  on  its 
strong  intuitive  appeal  and  come  back  to  its  proof  at  a  later  time. 


Proof.  We  shall  give  the  proof  for  the  case  where  f(x)  has  a  local  maximum  at  x  —  a.  The  proof  for  the 
local  minimum  case  is  similar. 

Since  f(x)  has  a  local  maximum  at  x  —  a,  there  is  an  open  interval  (c,d)  with  c  <  a  <  d  and  f(x)  < 
f  (a)  for  every  x  in  (c,d).  So,  /  (x)  —  /  (a)  <  0  for  all  such  x.  Let  us  now  look  at  the  sign  of  the  difference 

quotient  AAZM .  We  consider  two  cases  aeeording  as  *  >  a  or*  < 
x  —  a 

fix )  —  f  (a) 

If  x>  a,  then  x  —  a>  0  and  so, - <  0.  Taking  limit  as  x  approach  a  from  the  right,  we  get 

x  —  a 

lim  <  o. 

x^a+  x  —  a 


fix)  —  f  (a) 

On  the  other  hand,  if  x  <  a,  then  x  —  a  <  0  and  so, - >  0.  Taking  limit  as  x  approach  a  from  the 

x  —  a 

left,  we  get 

fix)—  fia) 

lim  >  0. 

x — yci  X  Cl 

Since  /  is  differentiable  at  a,  f'  (a)  —  lim  -L—l — ■ff1)  _  fix) — f  i°) .  Therefore,  we  have  both 

X 

f  {a)  <  0  and  /'  (a)  >  0.  So,  f'  (a)  =  0. 


x^a+  x  —  a 


x — yci  X  Cl 


Thus,  the  only  points  at  which  a  function  can  have  a  local  maximum  or  minimum  are  points  at  which 
the  derivative  is  zero,  as  in  the  left  hand  graph  in  Figure  5.5,  or  the  derivative  is  undefined,  as  in  the  right 
hand  graph.  Any  value  of  x  in  the  domain  of  /  for  which  f'(x)  is  zero  or  undefined  is  called  a  critical 
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point  for  /.  When  looking  for  local  maximum  and  minimum  points,  you  are  likely  to  make  two  sorts  of 
mistakes:  You  may  forget  that  a  maximum  or  minimum  can  occur  where  the  derivative  does  not  exist,  and 
so  forget  to  check  whether  the  derivative  exists  everywhere.  You  might  also  assume  that  any  place  that 
the  derivative  is  zero  is  a  local  maximum  or  minimum  point,  but  this  is  not  true.  A  portion  of  the  graph  of 
/(x)  =  x3  is  shown  in  Figure  5.6.  The  derivative  of  /  is  f\x)  =  3x2,  and  /'( 0)  =  0,  but  there  is  neither  a 
maximum  nor  minimum  at  (0, 0) . 


Figure  5.6:  No  maximum  or  minimum  even  though  the  derivative  is  zero. 

Since  the  derivative  is  zero  or  undefined  at  both  local  maximum  and  local  minimum  points,  we  need 
a  way  to  determine  which,  if  either,  actually  occurs.  The  most  elementary  approach,  but  one  that  is  often 
tedious  or  difficult,  is  to  test  directly  whether  the  y  coordinates  “near”  the  potential  maximum  or  minimum 
are  above  or  below  the  y  coordinate  at  the  point  of  interest.  Of  course,  there  are  too  many  points  “near” 
the  point  to  test,  but  a  little  thought  shows  we  need  only  test  two  provided  we  know  that  /  is  continuous 
(recall  that  this  means  that  the  graph  of  /  has  no  jumps  or  gaps). 

Suppose,  for  example,  that  we  have  identified  three  points  at  which  f  is  zero  or  nonexistent:  (xi ,  v  | ), 
ix2,yi),  {xT^y-i),  and  x\  <  X2  <  x3  (see  Figure  5.7).  Suppose  that  we  compute  the  value  of  f(a )  for 
x\  <  a  <x  2,  and  that  f(a)  <  f(x  2).  What  can  we  say  about  the  graph  between  a  and  x2?  Could  there  be  a 
point  (b,f(b)),  a  <b  <X2  with  f(b)  >  f(x  2)?  No:  if  there  were,  the  graph  would  go  up  from  ( a,f(a ))  to 
(, b,f(b ))  then  down  to  (x2,/(x2))  and  somewhere  in  between  would  have  a  local  maximum  point.  (This  is 
not  obvious;  it  is  a  result  of  the  Extreme  Value  Theorem.)  But  at  that  local  maximum  point  the  derivative 
of  /  would  be  zero  or  nonexistent,  yet  we  already  know  that  the  derivative  is  zero  or  nonexistent  only  at 
xi,  X2,  and  X3.  The  upshot  is  that  one  computation  tells  us  that  (x2,/(x2))  has  the  largest  y  coordinate  of 
any  point  on  the  graph  near  X2  and  to  the  left  of  X2.  We  can  perform  the  same  test  on  the  right.  If  we  find 
that  on  both  sides  of  X2  the  values  are  smaller,  then  there  must  be  a  local  maximum  at  (x2,/(x2));  if  we 
find  that  on  both  sides  of  X2  the  values  are  larger,  then  there  must  be  a  local  minimum  at  (x2,/(x2));  if  we 
find  one  of  each,  then  there  is  neither  a  local  maximum  or  minimum  at  X2. 


• 

• 

• 

• 

1  1  1 

x\  a  b 

l  l 

x2  x3 

Figure  5.7:  Testing  for  a  maximum  or  minimum. 
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It  is  not  always  easy  to  compute  the  value  of  a  function  at  a  particular  point.  The  task  is  made  easier 
by  the  availability  of  calculators  and  computers,  but  they  have  their  own  drawbacks — they  do  not  always 
allow  us  to  distinguish  between  values  that  are  very  close  together.  Nevertheless,  because  this  method  is 
conceptually  simple  and  sometimes  easy  to  perform,  you  should  always  consider  it. 


Example  5.9:  Simple  Cubic 


Find  all  local  maximum  and  minimum  points  for  the  function  f(x')  —  x3  —x. 


Solution.  The  derivative  is  f  (x)  =  3x2  —  1 .  This  is  defined  everywhere  and  is  zero  at  x  =  ±  v/3  /3 .  Looking 
first  at  x  =  v/3/3,  we  see  that  /(v/ 3/3)  =  — 2\/3/9.  Now  we  test  two  points  on  either  side  of  x  =  v/3/3, 
choosing  one  point  in  the  interval  (—s/3/3,  v/3/3)  and  one  point  in  the  interval  (v/3/3,00)-  Since  /( 0)  = 
0  >  —  2\/3/9  and  /( 1)  =  0  >  — 2v/3/9,  there  must  be  a  local  minimum  at  x  =  x/3/3.  For  x  =  —  v/3/3,  we 
see  that  /(—v/3/3)  =  2v/3/9.  This  time  we  can  use  x  =  0  andx  =  —1,  and  we  find  that  /(— 1)  =  /( 0)  = 
0  <  2V3/9 ,  so  there  must  be  a  local  maximum  at  x  =  —  v/3/3.  X 

Of  course  this  example  is  made  very  simple  by  our  choice  of  points  to  test,  namely  x  =  —  1,  0,  1. 
We  could  have  used  other  values,  say  —5/4,  1/3,  and  3/4,  but  this  would  have  made  the  calculations 
considerably  more  tedious,  and  we  should  always  choose  very  simple  points  to  test  if  we  can. 


Example  5.10:  Max  and  Min 


Find  all  local  maximum  and  minimum  points  forf(x)  =  sinx  +  cosx. 


Solution.  The  derivative  is  f{x)  —  cosx  —  sinx.  This  is  always  defined  and  is  zero  whenever  cosx  =  sinx. 
Recalling  that  the  cosx  and  sinx  are  the  x  and  y  coordinates  of  points  on  a  unit  circle,  we  see  that  cosx  = 
sinx  when  x  is  k/4,  7t/4±7T,  k/4±2k,  k/4±3k,  etc.  Since  both  sine  and  cosine  have  a  period  of  2%,  we 
need  only  determine  the  status  of  x  =  tt/4  and  x  =  57T/4.  We  can  use  0  and  n/ 2  to  test  the  critical  value 
x  —  k/4.  We  find  that  f(i r/4)  =  v/2,  /( 0)  =  1  <  v/2  and  /(tt/2)  —  1,  so  there  is  a  local  maximum  when 
x  —  7l/4  and  also  when  x  —  n/ 4  ±  2 n,  k/4  ±  4 it,  etc.  We  can  summarize  this  more  neatly  by  saying  that 
there  are  local  maxima  at  71/ 4  ±  2kn  for  every  integer  k. 

We  use  it  and  2k  to  test  the  critical  value  x  =  5iz/4.  The  relevant  values  are  /( 5tt/4)  —  —v/2,  f(iz)  — 
—  1  >  —v/2,  /{2k)  —  1  >  —v/2,  so  there  is  a  local  minimum  at  x  =  5tt/4,  57t/4 ±27U,  5k/ 4 ±4k,  etc. 
More  succinctly,  there  are  local  minima  at  5zr/4  ±  2kn  for  every  integer  k.  X 


Example  5.11:  Max  and  Min 


Find  all  local  maximum  and  minimum  points  for  g  (x)  =  x2/3 . 


Solution.  The  derivative  is  g'  (x)  =  |x-1/3.  This  is  undefined  when  x  =  0  and  is  not  equal  to  zero  for  any 
x  in  the  domain  of  g' .  Now  we  test  two  points  on  either  side  of  x  =  0.  We  use  x  =  —  1  and  x  =  1.  Since 
g(0)  —  0,  g(—  1)  =  1  >  0  and  g(l)  =  1  >  0,  there  must  be  a  local  minimum  at  x  =  0.  X 
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Exercises  for  5.2.1 


Find  all  local  maximum  and  minimum  points  (x,y)  by  the  method  of  this  section. 

Exercise  5.2.1  y  —  x2  —  x 

Exercise  5.2.2  y  —  2  +  3x  —  x3 
Exercise  5.2.3  y  —  x3  —  9x2  +  24x 
Exercise  5.2.4  y  —  x4  —  2x2  +  3 
Exercise  5.2.5  y  =  3x4  —  4x3 
Exercise  5.2.6  y  =  (x2  —  \  )/x 
Exercise  5.2.7  y  =  3x2  —  ( 1  /x2) 

Exercise  5.2.8  y  =  cos(2x)  —  x 
Exercise  5.2.9  /(x)  =  x2  —  98x  +  4 

Exercise  5.2.10  For  any  real  number  x  there  is  a  unique  integer  n  such  that  n<x  <n  +  1,  and  the  greatest 
integer  function  is  defined  as  |xj  =  n.  Where  are  the  critical  values  of  the  greatest  integer  function?  Which 
are  local  maxima  and  which  are  local  minima? 


Exercise  5.2.11  Explain  why  the  function  /(x)  =  1/x  has  no  local  maxima  or  minima. 

Exercise  5.2.12  Flow  many  critical  points  can  a  quadratic  polynomial  function  have? 

Exercise  5.2.13  Show  that  a  cubic  polynomial  can  have  at  most  two  critical  points.  Give  examples  to 
show  that  a  cubic  polynomial  can  have  zero,  one,  or  two  critical  points. 

Exercise  5.2.14  Explore  the  family  of  functions  /(x)  =  x3  +  cx+  1  where  c  is  a  constant.  How  many  and 
what  types  of  local  extremes  are  there?  Your  answer  should  depend  on  the  value  of  c,  that  is,  different 
values  ofc  will  give  different  answers. 

Exercise  5.2.15  We  generalize  the  preceding  two  questions.  Let  n  be  a  positive  integer  and  let  f  be  a 
polynomial  of  degree  n.  How  many  critical  points  can  f  have?  (Hint:  Recall  the  Fundamental  Theorem 
of  Algebra,  which  says  that  a  polynomial  of  degree  n  has  at  most  n  roots.) 


174  Applications  of  Derivatives 


5.2.2.  Absolute  Extrema 


Absolute  extrema  are  also  commonly  referred  to  as  global  extrema.  Unlike  local  extrema,  which  are  only 
“extreme"  relative  to  points  “close  to”  them,  an  absolute  (or  global)  extrema  is  “extreme"  relative  to  all 
other  points  in  the  interval  under  consideration. 


Definition  5.12:  Absolute  Maxima  and  Minima 


A  real-valued  function  f  has  an  absolute  maximum  on  an  interval  I  at  xq  if  f(x o)  is  the  largest 
value  of  f  on  I;  in  other  words,  f(x o)  >  f(x)  for  all  x  in  the  domain  of  f  that  are  in  I. 

A  real-valued  function  f  has  an  absolute  minimum  on  an  interval  I  at  xq  if  f(xo)  is  the  smallest 
value  of  f  on  I;  in  other  words,  f(x o)  <  f(x)  for  all  x  in  the  domain  of  f  that  are  in  I. 


Example  5.13:  Absolute  Extrema 


Consider  the  function  f  (x)  —  x2  on  the  interval  (—00,00).  This  parabola  has  an  absolute  minimum 
atx  —  0.  However,  it  does  not  have  an  absolute  maximum. 

Consider  the  function  f(x)  =  |jc|  on  the  interval  [—1,2].  This  graph  looks  like  a  check  mark.  It  has 
an  absolute  minimum  atx  —  0  and  an  absolute  maximum  atx  —  2. 

Consider  the  function  f(x)  =  cos*  on  the  interval  [0,  Jt] .  It  has  an  absolute  minimum  al  x  —  it  and 
an  absolute  maximum  atx  —  0. 

Consider  the  function  f(x)  —  ex  on  any  interval  [a,b],  where  a  <  b.  Since  this  exponential  function 
is  increasing,  it  has  an  absolute  minimum  atx  —  a  and  an  absolute  maximum  atx  —  b. 


Like  Fermat’s  Theorem,  the  following  theorem  has  an  intuitive  appeal.  However,  unlike  Fermat’s 
Theorem,  the  proof  relies  on  a  more  advanced  concept  called  compactness,  which  will  only  be  covered 
in  a  course  typically  entitled  Analysis.  So,  we  will  be  content  with  understanding  the  statement  of  the 
theorem. 


Theorem  5.14:  Extreme- Value  Theorem 


If  a  function  f  is  continuous  on  a  closed  interval  [ a,b ],  then  f  has  both  an  absolute  maximum  and 
an  absolute  minimum  on  [a,  b] . 


Although  this  theorem  tells  us  that  an  absolute  extremum  exists,  it  does  not  tell  us  what  it  is  or  how  to 
find  it. 

Note  that  if  an  absolute  extremum  is  inside  the  interval  (i.e.  not  an  endpoint),  then  it  must  also  be  a 
local  extremum.  This  immediately  tells  us  that  to  find  the  absolute  extrema  of  a  function  on  an  interval, 
we  need  only  examine  the  local  extrema  inside  the  interval,  and  the  endpoints  of  the  interval. 

We  can  devise  a  method  for  finding  absolute  extrema  for  a  function  /  on  a  closed  interval  [a,  b\ : 

1 .  Verify  the  function  is  continuous  on  [a,  b\ . 
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2.  Find  the  derivative  and  determine  all  critical  values  of  /  that  are  in  [a,  b] . 

3.  Evaluate  the  function  at  the  critical  values  found  in  Step  2  and  the  end  points  of  the  interval. 

4.  Identify  the  absolute  extrema. 

Why  must  a  function  be  continuous  on  a  closed  interval  in  order  to  use  this  theorem?  Consider  the 
following  example. 


Solution.  The  function  /  is  not  continuous  at  x  —  0.  Since  0  G  [—  1, 1],  /  is  not  continous  on  the  closed 
interval: 


lim  fix)  —  +°° 

*^0+ 

lim  f(x)  =  — °°, 

x-S’O- 

so  we  are  unable  to  apply  the  Extreme- Value  Theorem.  Therefore,  f(x)  —  1  /x  does  not  have  an  absolute 
maximum  or  an  absolute  minimum  on  [—1, 1].  X 

However,  if  we  consider  the  same  function  on  an  interval  where  it  is  continuous,  the  theorem  will 
apply.  This  is  illustrated  in  the  following  example. 


Example  5.16:  Absolute  Extrema  of  a  1/x 


Find  any  absolute  extrema  forf(x)  —  1  /x  on  the  interval  [1,2]. 


Solution.  The  function  /  is  continous  on  the  interval,  so  we  can  apply  the  Extreme- Value  Theorem.  We 
begin  with  taking  the  derivative  to  be  f'(x)  —  —1/x2  which  has  a  critical  value  at  x  —  0,  but  since  this 
critical  value  is  not  in  [1,2]  we  ignore  it.  The  only  points  where  an  extrema  can  occur  are  the  endpoints 
of  the  interval.  To  find  the  maximum  or  minimum  we  can  simply  evaluate  the  function:  /( 1)  =  1  and 
/( 2)  =  1/2,  so  the  absolute  maximum  is  at  x  —  1  and  the  absolute  minimum  is  at  x  —  2.  X 

Why  must  an  interval  be  closed  in  order  to  use  the  above  theorem?  Recall  the  difference  between  open 
and  closed  intervals.  Consider  a  function  /  on  the  open  interval  (0, 1).  If  we  choose  successive  values  of 
x  moving  closer  and  closer  to  1,  what  happens?  Since  1  is  not  included  in  the  interval  we  will  not  attain 
exactly  the  value  of  1.  Suppose  we  reach  a  value  of  0.9999  —  is  it  possible  to  get  closer  to  1?  Yes:  There 
are  infinitely  many  real  numbers  between  0.9999  and  1.  In  fact,  any  conceivable  real  number  close  to  1 
will  have  infinitely  many  real  numbers  between  itself  and  1.  Now,  suppose  /  is  decreasing  on  (0, 1):  As 
we  approach  1,  /  will  continue  to  decrease,  even  if  the  difference  between  successive  values  of  /  is  slight. 
Similarly  if  /  is  increasing  on  (0, 1). 

Consider  a  few  more  examples: 
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Example  5.17:  Determining  Absolute  Extrema 


Determine  the  absolute  extrema  of  f(x)  —  x3  —  x2  +  !  on  the  interval  [—1,2]. 


Solution.  First,  notice  /  is  continuous  on  the  closed  interval  [—1,2],  so  we’re  able  to  use  Theorem  5.14  to 
determine  the  absolute  extrema.  The  derivative  is  /'( x)  —  3x2  —  2x,  and  the  critical  values  are  x  —  0,2/3 
which  are  both  in  the  interval  [—1,2].  In  order  to  find  the  absolute  extrema,  we  must  consider  all  critical 
values  that  lie  within  the  interval  (that  is,  in  (  —  1,2))  and  the  endpoints  of  the  interval. 

/<-i)  =  (-i)3-(-i)2+i  =  -i 

/(o)  —  (0)3  —  (0)2  +  1  —  1 

/( 2/3)  -  (2/3)3  -  (2/3)2  +  1  =  23/27 
/(2)  =  (2)3  —  (2)2  +1  =  5 

The  absolute  maximum  is  at  (2,5)  and  the  absolute  minimum  is  at  (-1,-1).  £ 


Example  5.18:  Determining  Absolute  Extrema 


Determine  the  absolute  extrema  of  f(x)  —  — 9/x  — x  +  10  on  the  interval  [2,6]. 


Solution.  First,  notice  /  is  continuous  on  the  closed  interval  [2,6],  so  we’re  able  to  use  Theorem  5.14  to 
determine  the  absolute  extrema.  The  function  is  not  continuous  at  x  —  0,  but  we  can  ignore  this  fact  since 
0  is  not  in  [2,6].  The  derivative  is  f'(x)  —  9/x 2  —  1,  and  the  critical  values  are  x  =  ±3,  but  only  x  =  +3  is 
in  the  interval.  In  order  to  find  the  absolute  extrema,  we  must  consider  all  critical  values  that  lie  within  the 
interval  and  the  endpoints  of  the  interval. 

/( 2)  =  -9/(2)  -  (2)  +  10  =  111  =  3.5 
/(3)  =  —9/ (3)  —  (3)  +  10  =  4 
/(6)  =  —9/ (6)  —  (6)  +  10  =  5/2  =  2.5 

The  absolute  maximum  is  at  (3,4)  and  the  absolute  minimum  is  at  (6,2.5).  X 

When  we  are  trying  to  find  the  absolute  extrema  of  a  function  on  an  open  interval,  we  cannot  use 
the  Extreme  Value  Theorem.  However,  if  the  function  is  continuous  on  the  interval,  many  of  the  same 
ideas  apply.  In  particular,  if  an  absolute  extremum  exists,  it  must  also  be  a  local  extremum.  In  addition 
to  checking  values  at  the  local  extrema,  we  must  check  the  behaviour  of  the  function  as  it  approaches  the 
ends  of  the  interval. 

Some  examples  to  illustrate  this  method. 


Solution.  Notice  sec(x)  is  continuous  on  (— 7t/2,7t/2)  and  has  one  local  minimum  at  0.  Also 

lim  sec(x)  =  lim  sec(x)  =  +°°, 

*-►(— jt/2)+  x-t(x/ 2)~ 
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so  sec(x)  has  no  absolute  maximum,  but  the  point  (0, 1)  is  the  absolute  minimum.  X 

A  similar  approach  can  be  used  for  infinite  intervals. 


Solution.  Since  x~  +  1  ^  0  for  all  x  in  (—  oo5oo)  the  function  is  continuous  on  this  interval.  This  function 
has  only  one  critical  value  at  x  —  0,  which  is  the  local  minimum  and  also  the  absolute  minimum.  Now, 
limx_j.-too  -f—  =  1,  so  the  function  does  not  have  an  absolute  maximum:  It  continues  to  increase  towards 

■A  I  1 

1,  but  does  not  attain  this  exact  value. 


Exercises  for  5.2.2 


Exercise  5.2.16  Find  the  absolute  extrema  for  f(x)  —  —  on  [0,3]. 

Exercise  5.2.17  Find  the  absolute  extrema  for  f(x)  =  —  ^4  on  [0,3]. 

Exercise  5.2.18  Find  the  absolute  extrema  for  f  {x)  —  csc(v)  on  [0,  Jl\. 

Exercise  5.2.19  Find  the  absolute  extrema  for  f(x)  —  I  n  (x) /x1  on  [1,4]. 

Exercise  5.2.20  Find  the  absolute  extrema  for  f(x)  —  x\J  1  —  x2  on  [—1,1]. 

Exercise  5.2.21  Find  the  absolute  extrema  for  f(x)  —  xe  A~/32  on  [0,2]. 

Exercise  5.2.22  Find  the  absolute  extrema  for  f(x)  — tan  '(2x)  on  [0,2]. 

Exercise  5.2.23  Find  the  absolute  extrema  for  f{x)  —  ^ffj- 

Exercise  5.2.24  For  each  of  the  following,  sketch  a  potential  graph  of  a  continuous  function  on  the  closed 
interval  [0,4]  with  the  given  properties. 

(a)  Absolute  minimum  at  0,  absolute  maximum  at  2,  local  minimum  at  3. 

(b)  Absolute  maximum  at  1,  absolute  minimum  at  2,  local  maximum  at  3. 

(c)  Absolute  minimum  at  4,  absolute  maximum  at  1,  local  minimum  at  2,  local  maxima  at  1  and  3. 
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5.3  The  Mean  Value  Theorem 


There  are  numerous  applications  of  the  derivative  through  its  definition  as  rate  of  change  and  as  the  slope 
of  the  tangent  line.  In  this  section  we  shall  look  at  some  deeper  reasons  why  the  derivative  turns  out  to  be 
so  useful.  The  simple  answer  is  that  the  derivative  of  a  function  tells  us  a  lot  about  the  function.  More 
important,  “hard”  questions  about  a  function  can  sometimes  be  answered  by  solving  a  relatively  simple 
problem  about  the  derivative  of  the  function. 


The  Mean  Value  Theorem  tells  us  that  there  is  an  intimate  connection  between  the  net  change  of  the 
value  of  any  “sufficiently  nice”  function  over  an  interval  and  the  possible  values  of  its  derivative  on  that 
interval.  Because  of  this  connection,  we  can  draw  conclusions  about  the  possible  values  of  the  derivative 
based  on  information  about  the  values  of  the  function,  and  conversely,  we  can  draw  conclusions  about  the 
values  of  the  function  based  on  information  about  the  values  of  its  derivative. 


Value 

Theorem 


Let  us  illustrate  the  idea  through  the  following  two  interesting  questions  involving  derivatives: 

1.  Suppose  two  different  functions  have  the  same  derivative;  what  can  you  say  about  the  relationship 
between  the  two  functions? 
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2.  Suppose  you  drive  a  car  from  toll  booth  on  a  toll  road  to  another  toll  booth  at  an  average  speed  of 
70  miles  per  hour.  What  can  be  concluded  about  your  actual  speed  during  the  trip?  In  particular,  did 
you  exceed  the  65  mile  per  hour  speed  limit? 

While  these  sound  very  different,  it  turns  out  that  the  two  problems  are  very  closely  related.  We  know 
that  “speed”  is  really  the  derivative  by  a  different  name;  let’s  start  by  translating  the  second  question  into 
something  that  may  be  easier  to  visualize.  Suppose  that  the  function  f(t)  gives  the  position  of  your  car  on 
the  toll  road  at  time  t .  Your  change  in  position  between  one  toll  booth  and  the  next  is  given  by  f{t\ )  —  /(to), 
assuming  that  at  time  to  you  were  at  the  first  booth  and  at  time  t\  you  arrived  at  the  second  booth.  Your 
average  speed  for  the  trip  is  {f{t\ )  —  /(to)  )/(j\  —  k))-  If  we  think  about  the  graph  of  fit),  the  average  speed 
is  the  slope  of  the  line  that  connects  the  two  points  (to, /(h)) )  and  (ti,/(ti)).  Your  speed  at  any  particular 
time  t  between  to  and  t\  is  /'(f),  the  slope  of  the  curve.  Now  question  (2)  becomes  a  question  about  slope. 
In  particular,  if  the  slope  between  endpoints  is  70,  what  can  be  said  of  the  slopes  at  points  between  the 
endpoints? 

As  a  general  rule,  when  faced  with  a  new  problem  it  is  often  a  good  idea  to  examine  one  or  more 
simplified  versions  of  the  problem,  in  the  hope  that  this  will  lead  to  an  understanding  of  the  original 
problem.  In  this  case,  the  problem  in  its  “slope”  form  is  somewhat  easier  to  simplify  than  the  original,  but 
equivalent,  problem. 

Here  is  a  special  instance  of  the  problem.  Suppose  that  /(to)  =  /(fi).  Then  the  two  endpoints  have 
the  same  height  and  the  slope  of  the  line  connecting  the  endpoints  is  zero.  What  can  we  say  about  the 
slope  between  the  endpoints?  It  shouldn’t  take  much  experimentation  before  you  are  convinced  of  the 
truth  of  this  statement:  Somewhere  between  to  and  t\  the  slope  is  exactly  zero,  that  is,  somewhere  between 
to  and  t\  the  slope  is  equal  to  the  slope  of  the  line  between  the  endpoints.  This  suggests  that  perhaps  the 
same  is  true  even  if  the  endpoints  are  at  different  heights,  and  again  a  bit  of  experimentation  will  probably 
convince  you  that  this  is  so.  But  we  can  do  better  than  “experimentation” — we  can  prove  that  this  is  so. 

We  start  with  the  simplified  version: 


Theorem  5.21:  Rolle’s  Theorem 


Suppose  that  f(x)  has  a  derivative  on  the  interval  ( a,b ),  is  continuous  on  the  interval  [a,b],  and 
f(a )  —  f{b).  Then  at  some  value  c  G  (a,b),  f{c)  =  0. 


Proof.  We  know  that  f(x)  has  a  maximum  and  minimum  value  on  [a,  b]  (because  it  is  continuous),  and  we 
also  know  that  the  maximum  and  minimum  must  occur  at  an  endpoint,  at  a  point  at  which  the  derivative  is 
zero,  or  at  a  point  where  the  derivative  is  undefined.  Since  the  derivative  is  never  undefined,  that  possibility 
is  removed. 

If  the  maximum  or  minimum  occurs  at  a  point  c,  other  than  an  endpoint,  where  /'(c)  =  0,  then  we 
have  found  the  point  we  seek.  Otherwise,  the  maximum  and  minimum  both  occur  at  an  endpoint,  and 
since  the  endpoints  have  the  same  height,  the  maximum  and  minimum  are  the  same.  This  means  that 
f(x)  —  f(a )  =  f(b)  at  every  x  G  [a,b\,  so  the  function  is  a  horizontal  line,  and  it  has  derivative  zero 
everywhere  in  (a,  b) .  Then  we  may  choose  any  c  at  all  to  get  /'(c)  =0.  £ 

Rolle’s  Theorem  is  illustrated  below  for  a  function  f(x)  where  f'(x)  —  0  holds  for  two  values  of  x  —  c\ 
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andx  =  C2'- 


Perhaps  remarkably,  this  special  case  is  all  we  need  to  prove  the  more  general  one  as  well. 


Theorem  5.22:  Mean  Value  Theorem 


Suppose  that  fix)  has  a  derivative  on  the  interval  (a,  b )  and  is  continuous  on  the  interval  [a,  b] .  Then 

,  ,  ^  t  u\  I  \ 

at  some  value  c  G  (a,b),  f  (c)  = - - - . 


r  ( /■/  \ 

Proof.  Let  m  = - - - ,  and  consider  a  new  function  g(x)  =  /(x)  —  m(x  —  a)  —  f(a ) .  We  know  that 

b  —  a 

g(x)  has  a  derivative  everywhere,  since  g'(x)  —  f'(x )  —  m.  We  can  compute  g(a)  —  f(a)  —  m(a  —  a)  — 
f(a )  =  0  and 

gib)  =  f{b)  -  m(b  -a)-  f(a)  =  f{b )  -  f{b)hZ  fa{a)  (b  -a)-  f(a ) 

=  fib)  -  ifib)  -  f{a ))  -  f(a)  =  0. 


So  the  height  of  g(x)  is  the  same  at  both  endpoints.  This  means,  by  Rolle’s  Theorem,  that  at  some  c, 
g'(c)  —  0.  But  we  know  that  g'(c)  —  f'{c)  —  m,  so 


0  =  fic)-m  =  fic) 


fjb)-fja) 

b  —  a 


which  turns  into 

/(c) = mxm, 

b  —  a 

exactly  what  we  want. 

The  Mean  Value  Theorem  is  illustrated  below  showing  the  existence  of  a  point  x  —  c  for  a  function 
fix)  where  the  tangent  line  at  x  =  c  (with  slope  /'(c))  is  parallel  to  the  secant  line  connecting  A(a,f(a)) 
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and  B(b,f(b ))  (with  slope  ): 


Returning  to  the  original  formulation  of  question  (2),  we  see  that  if  /(f)  gives  the  position  of  your  car 
at  time  t,  then  the  Mean  Value  Theorem  says  that  at  some  time  c,  f'(c )  =  70,  that  is,  at  some  time  you 
must  have  been  traveling  at  exactly  your  average  speed  for  the  trip,  and  that  indeed  you  exceeded  the  speed 
limit. 

Now  let’s  return  to  question  (1).  Suppose,  for  example,  that  two  functions  are  known  to  have  derivative 
equal  to  5  everywhere,  f'(x)  =  g'(x)  =  5.  It  is  easy  to  find  such  functions:  5x,  5x  +  47,  5x—  132,  etc.  Are 
there  other,  more  complicated,  examples?  No — the  only  functions  that  work  are  the  “obvious”  ones, 
namely,  5x  plus  some  constant.  How  can  we  see  that  this  is  true? 

Although  “5”  is  a  very  simple  derivative,  let’s  look  at  an  even  simpler  one.  Suppose  that  f'(x)  — 
g'(x)  —  0.  Again  we  can  find  examples:  f(x)  —  0,  f(x)  —  47,  f(x)  =  —511  all  have  f'(x)  —  0.  Are  there 
non-constant  functions  /  with  derivative  0?  No,  and  here’s  why:  Suppose  that  f(x)  is  not  a  constant 
function.  This  means  that  there  are  two  points  on  the  function  with  different  heights,  say  f(o )  /  f{b). 
The  Mean  Value  Theorem  tells  us  that  at  some  point  c,  f'(c )  =  {fib)  —  f(a ))  /{b  — a)  0.  So  any  non¬ 
constant  function  does  not  have  a  derivative  that  is  zero  everywhere;  this  is  the  same  as  saying  that  the 
only  functions  with  zero  derivative  are  the  constant  functions. 

Let’s  go  back  to  the  slightly  less  easy  example:  suppose  that  f'(x)  —  g'(x)  —  5.  Then  (/(jc)  —  g(x))'  = 
f'(x)  —  g'(x)  —  5  —  5  =  0.  So  using  what  we  discovered  in  the  previous  paragraph,  we  know  that  fix)  — 
g(x)  =  k,  for  some  constant  k.  So  any  two  functions  with  derivative  5  must  differ  by  a  constant;  since  5x 
is  known  to  work,  the  only  other  examples  must  look  like  5x  +  k. 

Now  we  can  extend  this  to  more  complicated  functions,  without  any  extra  work.  Suppose  that  f{x)  — 
g'(x).  Then  as  before  (/(jc)  —  g(x))'  —  f{x)  —  g'(x)  —  0,  so  f(x)  —  g(x)  =  k.  Again  this  means  that  if  we 
find  just  a  single  function  g(x)  with  a  certain  derivative,  then  every  other  function  with  the  same  derivative 
must  be  of  the  form  g(x)  +  k. 


Example  5.23:  Given  Derivative 


Describe  all  functions  that  have  derivative  5x  —  3 . 


Solution.  It’s  easy  to  find  one:  g(;c)  =  (5/2)x2  —  3x  has  g'(x)  —  5x  —  3.  The  only  other  functions  with  the 
same  derivative  are  therefore  of  the  form  f(x)  —  (5 /2)x2  —  3x  +  k. 

Alternately,  though  not  obviously,  you  might  have  first  noticed  that  g(x)  —  (5/2) x2  -3x4-41  has 
g'(x)  —  5x  —  3 .  Then  every  other  function  with  the  same  derivative  must  have  the  form  f(x)  —  (5/2)j c2  — 
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3x  +  47  +  k.  This  looks  different,  but  it  really  isn’t.  The  functions  of  the  form  /(x)  =  (5/2)x2  —  3x  +-  k  are 
exactly  the  same  as  the  ones  of  the  form  /(x)  =  (5/2)x2  —  3x  +  47  +  k.  For  example,  (5/2)x2  —  3x+  10  is 
the  same  as  (5/2)x2  —  3x  +  47  +  (—37),  and  the  first  is  of  the  first  form  while  the  second  has  the  second 
form. 

This  is  worth  calling  a  theorem: 


Theorem  5.24:  Functions  with  the  Same  Derivative 


Iff'ix )  —  g\x)  for  every  x  e  ( a,b ),  thenforsome  constantk,  f(x)  —  g(x)  +  k  on  the  interval  (a,  b). 


Example  5.25:  Same  Derivative 


Describe  all  functions  with  derivative  sinx  +  ex .  One  such  function  is  —  cosx  +  ex,  so  all  such 
functions  have  the  form  —  cosx  +  ex  +  k. 


Theorem  5.24  and  the  above  example  illustrate  what  the  Mean  Value  Theorem  allows  us  to  say  about 
/(x)  when  we  have  perfect  information  about  f'(x).  Specifically,  /(x)  is  determined  up  to  a  constant.  Our 
next  example  illustrates  almost  the  opposite  extreme  situation,  one  where  we  have  much  less  information 
about  /'(x)  beyond  the  fact  that  f'(x)  exists.  Specifically,  assuming  that  we  know  an  upper  bound  on  the 
values  of  /'(x),  what  can  we  say  about  the  values  of  /(x)? 


Example  5.26:  Conclusion  Regarding  Function  Value  Based  on  Derivative  Information 


Suppose  that  f  is  a  differentiable  function  such  that  f  (x)  <  2  for  all  x.  What  is  the  largest  possible 
value  off  (7)  iff( 3)  =  5? 


Solution.  We  are  interested  in  the  values  of  /  (x)  at  x  =  3  and  x  =  7.  It  makes  sense  to  focus  our  attention 
on  the  interval  between  3  and  7.  It  is  given  that  /(x)  is  differentiable  for  all  x.  So,  /(x)  is  also  continuous 
at  all  x.  In  particular,  /(x)  is  continuous  on  the  interval  [3,7]  and  differentiable  on  the  interval  (3,7) .  By 
the  Mean  Value  Theorem,  we  know  that  there  is  some  c  in  (3,7)  such  that 


f(c) 


/(?)-/ (3) 

7-3 


Simplifying  and  using  the  given  information  /  (3)  =  5,  we  get 


f(c) 


/(7)~5 

4 


or,  after  re-arranging  the  terms, 

/(7)  =  4/'(c)  +  5. 

We  do  not  know  the  exact  value  of  c,  but  we  do  know  that  f  (x)  <  2  for  all  x.  This  implies  that  f  (c)  <  2. 
Therefore, 


/ (7)  <4-2  +  5=  13. 
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That  is,  the  value  of  /(7)  cannot  exceed  13.  To  convince  ourselves  that  13  (as  opposed  to  some  smaller 
number)  is  the  largest  possible  value  of  /  (7) ,  we  still  need  to  show  that  it  is  possible  for  the  value  of  /  (7) 
to  reach  13.  If  we  review  our  proof,  we  notice  that  the  inequality  will  be  an  equality  if  f  (c)  =  2.  One 
way  to  guarantee  this  without  knowing  anything  about  c  is  to  require  f  (x)  =  2  for  all  x.  This  means  that 
/  (x)  —  2 x  +  k  for  some  constant  k.  From  the  condition  /  (3)  =  5,  we  see  that  k  =  —  1 .  We  can  easily  verify 
that  indeed  /  (x)  =  2x  —  1  meets  all  our  requirements  and  /  (7)  =  13.  X 


Exercises  for  Section  5.3 


Exercise  5.3.1  Let  f{x)  —  x2.  Find  a  value  c  E  (—1,2)  so  that  f(c)  equals  the  slope  between  the  endpoints 
of  f(x)  on  [-1,2]. 

Exercise  5.3.2  Verify  that  /(x)  =  x/  (x  +  2)  satisfies  the  hypotheses  of  the  Mean  Value  Theorem  on  the 
interval  [1,4]  and  then  find  all  of  the  values,  c,  that  satisfy  the  conclusion  of  the  theorem. 

Exercise  5.3.3  Verify  that  /(x)  =  3x/  (x  +  7)  satisfies  the  hypotheses  of  the  Mean  Value  Theorem  on  the 
interval  [—2,6]  and  then  find  all  of  the  values,  c,  that  satisfy  the  conclusion  of  the  theorem. 


Exercise  5.3.4  Let  /(x)  =  tanx.  Show  that  fin)  =  f(2n)  =  0  but  there  is  no  number  c  E  (k,  2k)  such  that 
f\c)  —  0.  Why  does  this  not  contradict  Rolle’s  theorem? 


Exercise  5.3.5  Let  f(x)  —  (x  —  3)  2.  Show  that  there  is  no  value  c  E  (1,4)  such  that  f'(c )  =  (/( 4)  — 
/( 1 ) )  /  (4  —  1 ) .  Why  is  this  not  a  contradiction  of  the  Mean  Value  Theorem  ? 

Exercise  5.3.6  Describe  all  functions  with  derivative  x  +47x  —  5. 

Exercise  5.3.7  Describe  all  functions  with  derivative - T. 

1  +X" 

3  1 

Exercise  5.3.8  Describe  cdl  functions  with  derivative  x - . 


x 

Exercise  5.3.9  Describe  cdl  functions  with  derivative  sin(2x). 

Exercise  5.3.10  Find  f(x)  iff  (x)  =  e~x  and  f{  0)  =  2. 

Exercise  5.3.11  Suppose  that  f  is  a  differentiable  function  such  that  f  (x)  >  —3  for  all  x.  What  is  the 
smallest  possible  value  of  f  { 4)  iff(  —  1)  —2? 

Exercise  5.3.12  Show  that  the  equation  6x4  -  lx  +1=0  does  not  have  more  than  two  distinct  real  roots. 

Exercise  5.3.13  Let  f  be  differentiable  on  R.  Suppose  that  f'  (x)  f  Ofor  every  x.  Prove  that  f  has  at  most 
one  real  root. 
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Exercise  5.3.14  Prove  that  for  all  real  x  and  y  |cosx  —  cosy|  <  \x  —  y|.  State  and  prove  an  analogous 
result  involving  sine. 

Exercise  5.3.15  Show  that  \/l  +  x  <  1  +  (x/2)  if —  l  <x  <  1. 

Exercise  5.3.16  Suppose  that  f(a)  =  g(a)  and  that  f'{x)  <  g'(x)  for  all  x>  a. 

(a)  Prove  that  fix)  <  g(x)  for  all  x  >  a. 

(b)  Use  part  (a)  to  prove  that  ex  >  1  +xfor  all  x  >  0. 

x2 

(c)  Use  parts  (a)  and  (b)  to  prove  that  ex  >  1  +x  +  —  for  all  x  >  0. 

(d)  Can  you  generalize  these  results? 


5.4  Linear  and  Higher  Order  Approximations 


When  we  define  the  derivative  f  (x)  as  the  rate  of  change  of  /  (x)  with  respect  to  x,  we  notice  that  in 
relation  to  the  graph  of  /,  the  derivative  is  the  slope  of  the  tangent  line,  which  (loosely  speaking)  is 
the  line  that  just  grazes  the  graph.  But  what  precisely  do  we  mean  by  this?  In  short,  the  tangent  line 
approximates  the  graph  near  the  point  of  contact.  The  definition  of  the  derivative  /'  (a)  guarantees  this 
when  it  exists:  By  taking  x  sufficiently  close  to  a  but  not  equal  to  a, 

f(x)-f(a)  K/(a)- 

x  —  a 

and  consequently, 

f{x)  «/(a)  (x  — a)  +/  (a) . 

The  left  hand  side  gives  us  the  y-value  of  the  function  y  —  f  (x)  and  the  right  hand  side  gives  us  the  y-value 
y  =  f  (a)  (x  — a) +f  (a)  for  the  tangent  line  to  the  graph  of  /  at  the  point  ( a,f{a )) . 

In  this  section  we  will  explore  how  to  apply  this  idea  to  approximate  some  values  of  /,  some  changes 
in  the  values  of  /,  and  also  the  roots  of  /. 

5.4.1.  Linear  Approximations 


We  begin  by  the  first  derivative  as  an  application  of  the  tangent  line  to  approximate  /. 

Recall  that  the  tangent  line  to  /(x)  at  a  point  x  —  a  is  given  by 

L{x)  =  f' (a)  (x  —  a)  +f(a). 

The  tangent  line  in  this  context  is  also  called  the  linear  approximation  to  /  at  a. 

If  /  is  differentiable  at  a  then  L  is  a  good  approximation  of  /  so  long  as  x  is  “not  too  far”  from  a. 
Put  another  way,  if  /  is  differentiable  at  a  then  under  a  microscope  /  will  look  very  much  like  a  straight 
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line,  and  thus  will  look  very  much  like  L;  since  L(x)  is  often  much  easier  to  compute  than  f(x),  then  it 
makes  sense  to  use  L  as  an  approximation.  Figure  5.8  shows  a  tangent  line  to  y  —  x~  at  three  different 
magnifications. 


Figure  5.8:  The  linear  approximation  to  y  —  x2. 


Thus  in  practice  if  we  want  to  approximate  a  difficult  value  of  f(b),  then  we  may  be  able  to  approxi¬ 
mate  this  value  using  a  linear  approximation,  provided  that  we  can  compute  the  tangent  line  at  some  point 
a  close  to  b.  Here  is  an  example. 


Solution.  We  are  asked  to  calculate  /( 6)  =  i/6  +  4  =  \/l0  which  is  not  easy  to  do  without  a  calculator. 
However  9  is  (relatively)  close  to  10  and  of  course  /( 5)  =  y/9  is  easy  to  compute,  and  we  use  this  to 
approximate  \/l0. 

To  do  so  we  have  f'(x)  =  1  / (2\/x  +  4),  and  thus  the  linear  approximation  to  /  at  x  —  5  is 

L(x)  =  (vfri)  {x ~ 5) + '/j:r = x~r + 3- 

Now  to  estimate  \/l0,  we  substitute  6  into  the  linear  approximation  L(x)  instead  of  f(x),  to  obtain 

V6  +  4  «  +  3  =  ^  =  3i/6  =  3.16  «  3.17 

6  6 

It  turns  out  the  exact  value  of  \/T0  is  actually  3. 16227766. . .  but  our  estimate  of  3. 17  was  very  easy  to 
obtain  and  is  relatively  accurate.  This  estimate  is  only  accurate  to  one  decimal  place.  X 

With  modern  calculators  and  computing  software  it  may  not  appear  necessary  to  use  linear  approxima¬ 
tions,  but  in  fact  they  are  quite  useful.  For  example  in  cases  requiring  an  explicit  numerical  approximation, 
they  allow  us  to  get  a  quick  estimate  which  can  be  used  as  a  “reality  check”  on  a  more  complex  calculation. 
Further  in  some  complex  calculations  involving  functions,  the  linear  approximation  makes  an  otherwise 
intractable  calculation  possible  without  serious  loss  of  accuracy. 
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Example  5.28:  Linear  Approximation  of  Sine 


Find  the  linear  approximation  of  sinx  atx  —  0,  and  use  it  to  compute  small  values  of  sinx. 

Solution.  If  /(x)  —  sinx,  then  f'(x)  —  cosx,  and  thus  the  linear  approximation  of  sinx  at  x  =  0  is: 

L(x)  =  cos(O)  (x  —  0)  +  sin(0)  =  x. 

Thus  when  x  is  small  this  is  quite  a  good  approximation  and  is  used  frequently  by  engineers  and  scientists 
to  simplify  some  calculations. 

For  example  you  can  use  your  calculator  (in  radian  mode  since  the  derivative  of  sinx  is  cosx  only  in 
radian)  to  see  that 

sin(O.l)  =  0.099833416. . . 

and  thus  L(0.1)  =  0.1  is  a  very  good  and  quick  approximation  without  any  calculator!  X 

Exercises  for  5.4.1 


Exercise  5.4.1  Find  the  linearization  L(x)  of  /(x)  =  ln(l  +x)  at  a  —  0.  Use  this  linearization  to  approxi¬ 
mate  /( 0.1). 

Exercise  5.4.2  Use  linear  approximation  to  estimate  (1.9)3. 

Exercise  5.4.3  Show  in  detail  that  the  linear  approximation  of  sinx  at  x  —  0  is  L(x)  =  x  and  the  linear 
approximation  of  cosx  at  x  —  0  is  L(x)  =  1. 

Exercise  5.4.4  Use  /(x)  =  f/x  +  1  to  approximate  \/9  by  choosing  an  appropriate  point  x  —  a.  Are  we 
over-  or  under-estimating  the  value  of  \J 9?  Explain. 

5.4.2.  Differentials 


Very  much  related  to  linear  approximations  are  the  differentials  dx  and  dy,  used  not  to  approximate  values 
of  /,  but  instead  the  change  (or  rise)  in  the  values  of  /. 


Definition  5.29:  Differentials  dx  and  dy 


Let  y  =  /(x)  be  a  differentiable  function.  We  define  a  new  independent  variable  dx,  and  a  new 
dependent  variable  dy  —  f  (x)  dx.  Notice  thatdy  is  a  function  both  of  x  (since  f'ix)  is  a  function  of 
x)  and  of  dx.  We  call  both  dx  and  dy  differentials. 


Now  fix  a  point  a  and  let  A x  =  x  —  a  and  Ay 
we  set  dx  —  Ax  then  we  obtain 


/(x)  —  f(a) .  If  x  is  near  a  then  Ax  is  clearly  small.  If 

Ay, 


dy  —  f' (a )  dx  ~  —  Ax  —  Ay. 
Ax 
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Thus,  dy  can  be  used  to  approximate  Ay,  the  actual  change  in  the  function  /  between  a  and  x.  This  is 
exactly  the  approximation  given  by  the  tangent  line: 

dy  =  f  (a)  (x  -  a)  =  f  (a)  (x  -  a)  +  f{a)  -  f{a)  =  L(x)  -  f(a) . 

While  L(x)  approximates  fix),  dy  approximates  how  fix)  has  changed  from  f(a).  Figure  5.9  illustrates 
the  relationships. 


a  x 

Figure  5.9:  Differentials. 


Here  is  a  concrete  example. 


Example  5.30:  Rise  of  Natural  Logarithm 


Approximate  the  rise  of  fix)  —  \nx  from  x  —  1  tox—  1.1,  using  linear  approximation. 


Solution.  Note  that  ln(l.l)  is  not  readily  calculated  (without  a  calculator)  hence  why  we  wish  to  use  linear 
approximation  to  approximate  /( 1 . 1 )  —  /( 1 ) . 

We  fix  <a  —  l  and  as  above  we  have  Ax  —  x  —  1  and  Ay  =  f{x)  —  /( 1)  =  lnx,  and  obtain 

dy  —  f'{\)dx  ~  =  Ay. 

But  f' (x)  —  \/x  and  thus  /'( 1)  =  1/1  =  1,  we  obtain  in  this  case 

dy  —  r/x  ~  Ay. 

Finally  for  x  =  1.1,  we  can  easily  approximate  the  rise  of  /  as 

/(1-1)— /(l)  =  Ay  &  dy  =  1.1  —  1  =  0.1. 


The  correct  value  of  ln(l.l)  =  In  1  is  0.0953. . .  and  thus  we  were  relatively  close. 


X 
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Exercises  for  5.4.2 


Exercise  5.4.5  Let  f(x)  —  x4.  If  a  —  1  and  dx  —  Ax  —  1/2,  what  are  Ay  and  dy? 

Exercise  5.4.6  Let  f(x)  —  yfx.  If  a  =  1  and  dx  —  Ax  —  1/ 10,  what  are  Ay  and  dy? 

Exercise  5.4.7  Let  f(x)  —  sin(2x).  If  a  —  %  and  dx  —  Ax  —  nf  100,  what  are  Ay  and  dy? 

Exercise  5.4.8  Use  differentials  to  estimate  the  amount  of  paint  needed  to  apply  a  coat  of  paint  0.02  cm 
thick  to  a  sphere  with  diameter  40  meters.  (Recall  that  the  volume  of  a  sphere  of  radius  r  is  V  —  (4/3)71 r\ 
Notice  that  you  are  given  that  dr  —  0.02.) 

5.4.3.  Taylor  Polynomials 


We  can  go  beyond  first  order  derivatives  to  create  polynomials  approximating  a  function  as  closely  as  we 
wish,  these  are  called  Taylor  Polynomials. 

While  our  linear  approximation  L(x)  —  f  (a)  (x  —  a)+  f(a)  at  a  point  a  was  a  polynomial  of  degree  1 
such  that  both  L(a)  —  f(a)  and  if  (a)  —  f'(a),  we  can  now  form  a  polynomial 

Tn(x)  =  ao  +  ai(x  —  a)  +  aj(x  —  a)2  -\- a?,(x  —  a)3  - b  an(x  —  a)n 


which  has  the  same  first  n  derivatives  at  x  =  a  as  the  function  /. 
By  successively  computing  the  derivatives  of  Tn,  we  obtain: 


ao  =  f{a) 


a  i 

ai 


/'(q) 

2! 


f{a) 

0! 


ak  = 
an 


k\ 


where  f^k\x)  is  the  kth  derivative  of  f(x),  and  n\  —  n(n—  l)(n  —  2) . . .  (2) ( 1),  referred  to  as  factorial 
notation. 

Here  is  an  example. 


Example  5.31:  Approximate  e  using  Taylor  Polynomials 


Approximate  e*  using  Taylor  polynomials  at  a  —  0,  and  use  this  to  approximate  e. 


Solution.  In  this  case  we  use  the  function  f(x)  —  ex  at  a  =  0,  and  therefore 

Tn (x)  =  ciq  +  a\x  +  axx 2  +  a^x3  +  . . .  +  anAl 
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Since  all  derivatives  / ^  (x)  =  ex,  we  get: 


a0 

a  i  = 
02  = 
as  = 


m  =  i 
/( 0)  _  1 
1!  ~  1 
f"( 0)  _  1 
2!  —  2! 
/"( 0)  _  1 
3!  —  3! 


Uk  = 


f(k\ 0) 

A-! 


/(n)(0) 

n\ 


n! 


Thus 


and  in  general 


T\  (x)  =  1  +x  —  L(x) 

Tl  (x)  =  1  +  X  +  yj- 
Ts(x)  —  1  + x  +  +  fy 


Tn(x) 


2  3 

1+J:+2!  +  3! 


+  •••  + 


x" 

n! 


Finally  we  can  approximate  e  —  /(l)  by  simply  calculating  7J,(1).  A  few  values  are: 


7i(l)  =  l  +  l=2 
72(1)  =  1  +  1  +  |  =  2.5 

74.(1)  =  l  +  l  +  ^y  +  yj-  =  2.6 

r8(l)  =  2.71825396825 
T20(l)  =2.71828182845 


e, 


We  can  continue  this  way  for  larger  values  of  n,  but  Tij)  ( 1 )  is  already  a  pretty  good  approximation  of 
and  we  took  only  20  terms!  £ 


Exercises  for  5.4.3 


Exercise  5.4.9  Find  the  5th  degree  Taylor  polynomial  for  /(x)  =  sinx  around  a  —  0. 

(a)  Use  this  Taylor  polynomial  to  approximate  sin(O.l). 

( b)  Use  a  calculator  to  find  sin(O.l).  How  does  this  compare  to  our  approximation  in  part  (a)? 


Exercise  5.4.10  Suppose  that  f"  exists  and  is  continuous  on  [1,2].  Suppose  also  that  \  f" (x)  <  ^  for  all 
x  in  (1,2).  Prove  that  if  we  use  the  linearization  y  —  L(x)  ofy  —  /(x)  at  x  —  1  as  an  approximation  of 
y  =  /(x)  near  x  =  1,  then  our  estimated  value  off(  1 .2)  is  guaranteed  to  have  an  accuracy  of  at  least  0. 01, 
i.e.,  our  estimate  will  lie  within  0.01  units  of  the  true  value. 
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Exercise  5.4.11  Find  the  3rd  degree  Taylor  polynomial  for  f{x)  —  y—  —  1  around  a  —  0.  Explain  why  this 
approximation  would  not  be  useful  for  calculating  /( 5). 

Exercise  5.4.12  Consider  fix)  —  lnx  around  a—\. 

(a)  Find  a  general  formula  for  f{n]  (x)  for  n  >  1. 

(b)  Find  a  general  formula  for  the  Taylor  Polynomial,  Tn(x). 


5.4.4.  Newton’s  Method 


A  well  known  numeric  method  is  Newton’s  Method  (also  sometimes  referred  to  as  Newton-Raphson’s 
Method),  named  after  Isaac  Newton  and  Joseph  Raphson.  This  method  is  used  to  find  roots,  or  x-intercepts, 
of  a  function.  While  we  may  be  able  to  find  the  roots  of  a  polynomial  which  we  can  easily  factor,  we  saw 
in  the  previous  chapter  on  Limits,  that  for  example  the  function  ex  +  x  —  0  has  a  solution  (i.e.  root,  or 
x-intercept)  at  x  ~  —0.56714.  By  the  Intermediate  Value  Theorem  we  know  that  the  function  ex  +x  —  0 
does  have  a  solution.  We  cannot  here  simply  solve  for  such  a  root  algebraically,  but  we  can  use  a  numerical 
method  such  as  Newton's.  Such  a  process  is  typically  classified  as  an  iterative  method,  a  name  given  to 
a  technique  which  involves  repeating  similar  steps  until  the  desired  accuracy  is  obtained.  Many  computer 
algorithms  are  coded  with  a  for-loop,  repeating  an  iterative  step  to  converge  to  a  solution. 

The  idea  is  to  start  with  an  initial  value  xo  (approximating  the  root),  and  use  linear  approximation  to 
create  values  xi,  X2,  •  •  •  getting  closer  and  closer  to  a  root. 

The  first  value  xi  corresponds  to  the  intercept  of  the  tangent  line  of  /(x o)  with  the  x-axis,  which  is: 


xi 


=  *o- 


/(*o) 

/'  Oo) 


Figure  5.10:  First  iteration  of  Newton’s  Method. 

We  can  see  in  Figure  5.10,  that  if  we  compare  the  point  (xo,0)  to  (xi,0),  we  would  likely  come  to 
the  conclusion  that  (xi,0)  is  closer  to  the  actual  root  of  /(x)  than  our  original  guess,  (xq,0).  As  will  be 
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discussed,  the  choice  of  xq  must  be  done  correctly,  and  it  may  occur  that  x\  does  not  yield  a  better  estimate 
of  the  root. 

Newton’s  method  is  simply  to  repeat  this  process  again  and  again  in  an  effort  to  obtain  a  more  accurate 
solution.  Thus  at  the  next  step  we  obtain: 


X2  =  Xl 


/(-h) 

/'(*  i) 


Figure  5.11:  Second  iteration  of  Newton’s  Method. 


We  can  now  clearly  see  how  (*2,0)  is  a  better  estimate  of  the  root  of  fix),  rather  than  any  of  the 
previous  points.  Moving  forward,  we  will  get: 


*3 


=  X2~ 


/(*  2) 

fix  2) 


Rest  assured,  (*3,0)  will  be  an  even  better  estimate  of  the  root!  We  express  the  general  iterative  step  as: 

_  f(xn) 

X„+1-X„ 


The  idea  is  to  iterate  these  steps  to  obtain  the  desired  accuracy.  Here  is  an  example. 


Example  5.32:  Newton  method  to  approximate  roots 


Use  Newton ’s  method  to  approximate  the  roots  of  f(x)  —  x3  —  x  +  1 . 


Solution.  You  can  try  to  find  solve  the  equation  algebraically  to  see  that  this  is  a  difficult  task,  and  thus  it 
make  sense  to  try  a  numerical  method  such  as  Newton’s. 

To  find  an  initial  value  xq,  note  that  /(— 1)  =  —  5  and  /( 0)  =  1,  and  by  the  Intermediate  Value  Theorem 
this  /  has  a  root  between  these  two  values,  and  we  decide  to  start  with  xq  —  —  1  (you  can  try  other  values 
to  see  what  happens). 
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Note  that  f'(x)  —  3x2  —  1,  and  thus  we  get 


f(xn)  ,  xl-Xn  +  1 

Xn+ 1  =X„-—^=X  +  n - - — 

/ \Xn)  3 X*  ~  1 

Thus  we  can  produce  the  following  values  (try  it): 

X0  =  -1 

xi  —  —1.5000 
X2  =  -1.347826.. 
jc3  =  -1.325200.. 
x4  =  -1.3247 18.. 
jc5  =  -1.324717.. 

*6  =  -1.324717.. 


and  we  can  now  approximate  the  root  as  —1.324717.  X 

As  with  any  numerical  method,  we  need  to  be  aware  of  the  weaknesses  of  any  technique  we  are  using. 


Figure  5.12:  Function  with  three  distinct  solutions. 


If  we  know  our  root  is  somewhere  near  a,  we  would  make  our  guess  xq  —  a.  Generally  speaking,  a 
good  practice  is  to  make  our  guess  as  close  to  the  actual  root  as  possible.  In  some  cases  we  may  have  no 
idea  where  the  root  is,  so  it  would  be  prudent  to  perform  the  algorithm  several  times  on  several  different 
initial  guesses  and  analyze  the  results. 

For  example  we  can  see  in  Figure  5.12  that  f(x)  in  fact  has  three  roots,  and  depending  on  our  initial 
guess,  we  may  get  the  algorithm  to  converge  to  different  roots.  If  we  did  not  know  where  the  roots  were, 
we  would  try  the  technique  several  times.  In  one  instance,  if  our  initial  guess  was  xa,  we’d  likely  converge 
to  (a,  0).  Then  if  we  were  to  choose  another  guess,  Xb,  then  we’d  likely  converge  to  (b,  0).  Eventually, 
using  various  initial  guesses  we’d  get  one  of  three  roots:  a,  b ,  or  c.  Under  these  circumstances  we  can 
clearly  see  the  effectiveness  of  this  numeric  method. 
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As  another  example  if  we  attempt  to  use  Newton's  Method  on  /(x)  —  sinx  using  xo  =  n/2,  then 
f'(x o)  =  0  so  xi  is  undefined  and  we  cannot  proceed.  Even  in  general  x„+i  is  typically  nowhere  near  xn, 
and  in  general  not  converging  to  the  root  nearest  to  our  initial  guess  of  xo-  In  effect,  the  algorithm  keeps 
"bouncing  around".  An  example  of  which  is  depicted  in  Figure  5.13.  Based  on  our  initial  guess  for  such 
a  function,  the  algorithm  may  or  may  not  converge  to  a  root,  or  it  may  or  may  not  converge  to  the  root 
closest  to  the  initial  guess.  This  gives  rise  to  the  more  common  issue:  Selection  of  the  initial  guess,  xo- 

Here  is  a  summary. 

Key  Points  in  using  Newton’s  method  to  approximate  a  root  of  /(x) 

1 .  Choosing  xo  as  close  as  possible  to  the  root  we  wish  to  find. 

2.  A  guess  for  xo  which  makes  the  algorithm  “bounce  around”  is  considered  unstable. 

3.  Even  the  smallest  changes  to  xo  can  have  drastic  effects:  We  may  converge  to  another  root,  we  may 
converge  very  slowly  (requiring  many  more  iterations),  or  we  may  encounter  an  unstable  point. 

4.  We  may  encounter  a  stationary  point  if  we  choose  xo  such  that  fix)  —  0  (; i.e .  at  a  critical  point!)  in 
which  case  the  algorithm  fails. 

This  is  all  to  say  that  your  initial  guess  for  xq  can  be  extremely  important. 


Exercises  for  5.4.4 


Exercise  5.4.13  Use  Newton’s  Method  to  find  all  roots  of  f(x)  —  3x2  —  9x  —  1 1.  (Hint:  use  Intermediate 
Value  Theorem  to  choose  an  appropriate  x$) 

Exercise  5.4.14  Consider  /(x)  =  x3  — x2  +x—  1. 
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(a)  Using  initial  approximation  xo  =  2,  find  X4. 

(b)  What  is  the  exact  value  of  the  wot  of  f?  How  does  this  compare  to  our  approximation  X4  in  part 
(ci)? 

(c)  What  would  happen  if  we  chose  xq  —  0  as  our  initial  approximation? 

Exercise  5.4.15  Consider  f(x)  —  sinx.  What  happens  when  we  choose  xq  =  k/2?  Explain. 


5.5  L’HopitaPs  Rule 


The  following  application  of  derivatives  allows  us  to  compute  certain  limits. 


Definition  5.33:  Limits  of  the  Indeterminate  Forms  R  and  - 

U  00 


A  limit  of  a  quotient  lim  is  said  to  be  an  indeterminate  form  of  the  type  if  both  f  (x)  — y  0 

and  g  (x)  — >■  0  as  x  — >•  a.  Likewise,  it  is  said  to  be  an  indeterminate  form  of  the  type  ^  if  both 
f  (x)  — >  ±o°  and  g  (x)  — >•  ±°°  as  x  — >  a  (Here,  the  two  ±  signs  are  independent  of  each  other). 


This  theorem  is  somewhat  difficult  to  prove,  in  part  because  it  incorporates  so  many  different  possibil¬ 
ities,  so  we  will  not  prove  it  here. 

We  should  also  note  that  there  may  be  instances  where  we  would  need  to  apply  L’Hopital’s  Rule 

fr  (x) 

multiple  times,  but  we  must  confirm  that  lim*  cfn-  is  still  indeterminate  before  we  attempt  to  apply 

8  W 

L’Hopital’s  Rule  again.  Finally,  we  want  to  mention  that  L’Hopital’s  rule  is  also  valid  for  one-sided  limits 
and  limits  at  infinity. 


Solution.  We  use  L’Hopital’s  Rule:  Since  the  numerator  and  denominator  both  approach  zero, 


x2  —  K2  2x 

lim  — : - =  lim - , 

x-^n  sinx  x^ncosx 
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provided  the  latter  exists.  But  in  fact  this  is  an  easy  limit,  since  the  denominator  now  approaches  —1,  so 

X2  —  K2  2  K 

lim - =  —  =  —2  k. 

x  ^  7i  sinx  —1 

X 


Solution.  As  x  goes  to  infinity,  both  the  numerator  and  denominator  go  to  infinity,  so  we  may  apply 
L’Hopital’s  Rule: 

, .  2x2  —  3x  +  7  4x  —  3 

lim  ^ - =  lim - . 

x^-oo  x 2  +  47x  +  1  2x  +  47 

In  the  second  quotient,  it  is  still  the  case  that  the  numerator  and  denominator  both  go  to  infinity,  so  we  are 
allowed  to  use  L’Hopital’s  Rule  again: 


lim  All  =  Hm  * 

jc->oo  2x  +  47  2 


2. 


So  the  original  limit  is  2  as  well. 


X 


Solution.  Both  the  numerator  and  denominator  approach  zero,  so  applying  L’Hopital’s  Rule: 


secx— 1  secxtanx  1-0 

lim - =  lim - = - =  0. 

x->o  sinx  jc-40  cosx  1 


X 

L’Hopital’s  rule  concerns  limits  of  a  quotient  that  are  indeterminate  forms.  But  not  all  functions  are 
given  in  the  form  of  a  quotient.  But  all  the  same,  nothing  prevents  us  from  re-writing  a  given  function 
in  the  form  of  a  quotient.  Indeed,  some  functions  whose  given  form  involve  either  a  product  /  (x)  g  (x) 
or  a  power  f(x)8^  carry  indeterminacies  such  as  0-  ±°°  and  1±0°.  Something  small  times  something 
numerically  large  (positive  or  negative)  could  be  anything.  It  depends  on  how  small  and  how  large  each 
piece  turns  out  to  be.  A  number  close  to  1  raised  to  a  numerically  large  (positive  or  negative)  power  could 
be  anything.  It  depends  on  how  close  to  1  the  base  is,  whether  the  base  is  larger  than  or  smaller  than  1, 
and  how  large  the  exponent  is  (and  its  sign).  We  can  use  suitable  algebraic  manipulations  to  relate  them  to 
indeterminate  quotients.  We  will  illustrate  with  two  examples,  first  a  product  and  then  a  power. 
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Solution.  This  doesn’t  appear  to  be  suitable  for  L’Hopital’s  Rule,  but  it  also  is  not  “obvious”.  As  x 
approaches  zero,  lnx  goes  to  —  °°,  so  the  product  looks  like: 

(something  very  small)  •  (something  very  large  and  negative). 

This  could  be  anything:  it  depends  on  how  small  and  how  large  each  piece  of  the  function  turns  out  to  be. 
As  defined  earlier,  this  is  a  type  of  ±“0  ■  °°”,  which  is  indeterminate.  So  we  can  in  fact  apply  L’Hopital’s 
Rule  after  re-writing  it  in  the  form  ^ : 

lnx  lnx 

xlnx  =  — —  =  — T. 

1/x  x  1 

Now  as  x  approaches  zero,  both  the  numerator  and  denominator  approach  infinity  (one  — and  one  +°°, 
but  only  the  size  is  important).  Using  L’Hopital’s  Rule: 

lnx  1/x  i. 

Inn  — T  —  lim  - =  lim  x  )  =  hm  — x  =  0. 

*^0+  X  1  jc->0+  —  X~z  x->0+  X  x-^0+ 

One  way  to  interpret  this  is  that  since  lim  xlnx  =  0,  the  x  approaches  zero  much  faster  than  the  lnx 

x-^0+ 

approaches  —  °°.  X 

Finally,  we  illustrate  how  a  limit  of  the  type  “1°°”  can  be  indeterminate. 


Example  5.39:  L’Hopital’s  Rule 


Evaluate  lim  xl^x 

x->l+ 


Solution  Plugging  in  x  =  1  (from  the  right)  gives  a  limit  of  the  type  “1°°”.  To  deal  with  this  type  of  limit 
we  will  use  logarithms.  Let 

L=  lim  x1/(x_1). 

x—tl+ 

Now,  take  the  natural  log  of  both  sides: 


Using  log  properties  we  have: 


lnL=  lim  In  \xx^x  . 

x->i+  V 

lnL  =  lim  -hL 

x->i+  x—  1 


The  right  side  limit  is  now  of  the  type  0/0,  therefore,  we  can  apply  L’Hopital’s  Rule: 


1„L  =  lim  =  lim  X-E  =  1 

x— ^1+  X — 1  x— »1+  1 


Thus,  InL  =  1  and  hence,  our  original  limit  (denoted  by  L)  is:  L  —  el  —  e.  That  is, 

L  =  lim  x1/(jc_1)  =  e. 

x— >1+ 


In  this  case,  even  though  our  limit  had  a  type  of  “1°°”,  it  actually  had  a  value  of  e. 
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Exercises  for  5.5 


Compute  the  following  limits. 

,,  .  _  _  „  cosx—  1 

Exercise  5.5.1  lim  — : - 


x—?o  smx 


e* 

Exercise  5.5.2  lim 

x— xi 


Exercise  5.5.3  lim  — 

X 


\yix 

Exercise  5.5.4  lim  — — 

\/x 


Exercise  5.5.5  lim 


\/9+x  —  3 


Exercise  5.5.6  lim 


x — ^0  X 

2  —  \J x  +  2 


x->2  4  —  X2 


[x—  1 

Exercise  5.5.7  lim 


*->1  1^/x—  1 

(1  — jc)1/4  —  1 

Exercise  5.5.8  lim  - - - - 

3C — X 


Exercise  5.5.9  lim(t  +  -  ]  ((4  —  t)3/2  —  8) 
t->o  V  t  / 


Exercise  5.5.10  lim  (-  +  -!=)  (\/F+T— 1) 
t^o+\t  y/ty 


Exercise  5.5.11  lim 


X-^-0  yj 2X  +7  —  1 

(m-1)3 


Exercise  5.5.12  lim  — ,  .  _ 

u-+\  (l/u)-u2  +  3/u-3 


Exercise  5.5.13  lim 


Exercise  5.5.14  lim 


2  +  (1/x) 

3  —  (2/x) 

1  +  5/  \/x 


x— >o+  2  +  1  / \fx 
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cosx 

Exercise  5.5.15  lim  - — — - 

x-*n/2  (7T/2)  —x 

ex-l 

Exercise  5.5.16  lim - 

X — ^0  X 


A 

Exercise  5.5.17  lim - - 

x->o  ex  —  x  —  1 

lnjt 

Exercise  5.5.18  lim - 

X—  1 


Exercise  5.5.19  lim 

x-^0 


ln(x2  + 1) 


Exercise  5.5.20  lim 


x — y  1  x  1 


Exercise  5.5.21  lim  ^  ^ 

ln(x  +  1) 

x1/4  -  1 

Exercise  5.5.22  lim - 

x — y  1  X 


\fx—  1 

Exercise  5.5.23  lim - 

x — V 1  X  1 


_  .  3x2  +  x  +  2 

Exercise  5.5.24  lim - 

-v— x  —  4 

Exercise  5.5.25  lim  ^ - 

vA  +  4-2 

Exercise  5.5.26  lim  ^ - 

X^0y/x+2-2 

Exercise  5.5.27  lim  ^  — — 

-V— >0+  VX  +1  —  1 

\/ x2  +1  —  1 

Exercise  5.5.28  lim  — .  - 

*->0  vX+l  —  1 


Exercise  5.5.29  lim  (x  +  5)  (  J-  H - - — 

x-+iK  J  \2x  x  +  2 


x3  —  6x  —  2 

Exercise  5.5.30  lim - = - - — 

x-S-2  Xi  +  4 


Exercise  5.5.31  Discuss  what  happens  if  we  try  to  use  L’Hdpital’s  rule  to  find  the  limit  lim 

X — ^oc 


x  +  sinx 


x-^°°  x  + 1 


5.6.  Curve  Sketching  199 


5.6  Curve  Sketching 


In  this  section,  we  discuss  how  we  can  tell  what  the  graph  of  a  function  looks  like  by  performing  simple 
tests  on  its  derivatives. 

5.6.1.  Intervals  of  Increase/Decrease,  and  the  First  Derivative  Test 


The  method  of  Section  5.2.1  for  deciding  whether  there  is  a  local  maximum  or  minimum  at  a  critical  value 
is  not  always  convenient.  We  can  instead  use  information  about  the  derivative  f'(x)  to  decide;  since  we 
have  already  had  to  compute  the  derivative  to  find  the  critical  values,  there  is  often  relatively  little  extra 
work  involved  in  this  method. 

How  can  the  derivative  tell  us  whether  there  is  a  maximum,  minimum,  or  neither  at  a  point?  Suppose 
that  /  is  differentiable  at  and  around  x  =  a,  and  suppose  further  that  a  is  a  critical  point  of  /.  Then  we 
have  several  possibilities: 

1.  There  is  a  local  maximum  at  x  —  a.  This  happens  if  f'(x)  >  0  as  we  approach  x  —  a  from  the  left 
(i.e.  when  x  is  in  the  vicinity  of  a,  and  x  <  a)  and  f'(x)  <  0  as  we  move  to  the  right  of  x  —  a  (i.e. 
when  x  is  in  the  vicinity  of  a,  and  x  >  a). 

2.  There  is  a  local  minimum  at  x  —  a.  This  happens  if  f (x)  <  0  as  we  approach  x  —  a  from  the  left 
(i.e.  when  x  is  in  the  vicinity  of  a,  and  x  <  a)  and  f'(x)  >  0  as  we  move  to  the  right  of  x  —  a  (i.e. 
when  x  is  in  the  vicinity  of  a,  and  x  >  a). 

3.  There  is  neither  a  local  maximum  or  local  minimum  at  x  —  a.  If  f{x)  does  not  change  from  negative 
to  positive,  or  from  positive  to  negative,  as  we  move  from  the  left  of  x  =  a  to  the  right  of  x  —  a  (that 
is,  f'(x )  is  positive  on  both  sides  of  x  =  a ,  or  negative  on  both  sides  of  x  —  a)  then  there  is  neither  a 
maximum  nor  minimum  when  x  =  a. 

See  the  first  graph  in  Figure  5.5  and  the  graph  in  Figure  5.6  for  examples. 


Example  5.40:  Local  Maximum  and  Minimum 


Find  all  local  maximum  and  minimum  points  for  f  (x)  =  sinx  +  cosx  using  the  first  derivative  test. 


Solution.  The  derivative  is  /'( x)  —  cosx  —  sinx  and  from  Example  5.10  the  critical  values  we  need  to 
consider  are  n/4  and  5tt/4. 

We  analyze  the  graphs  of  sinx  and  cosx.  Just  to  the  left  of  7t/4  the  cosine  is  larger  than  the  sine,  so 
f'{x)  is  positive;  just  to  the  right  the  cosine  is  smaller  than  the  sine,  so  f{x )  is  negative.  This  means  there 
is  a  local  maximum  at  k/4.  Just  to  the  left  of  5k /A  the  cosine  is  smaller  than  the  sine,  and  to  the  right  the 
cosine  is  larger  than  the  sine.  This  means  that  the  derivative  f\x )  is  negative  to  the  left  and  positive  to  the 
right,  so  /  has  a  local  minimum  at  5tt/4.  £ 

The  above  observations  have  obvious  intuitive  appeal  as  you  examine  the  graphs  in  Figures  5.5  and 
5.6.  We  can  extend  these  ideas  further  and  then  formulate  and  prove  a  theorem:  If  the  graph  of  /  is 
increasing  before  (i.e.,  to  the  left  of )  x  —  a  and  decreasing  after  (i.e.,  to  the  right  of)  x  =  a,  then  there  is 
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a  local  maximum  at  x  =  a.  If  the  graph  of  /  is  decreasing  before  x  —  a  and  increasing  after  x  —  a,  then 
there  is  a  local  minimum  at  x  —  a.  If  the  graph  of  /  is  consistently  increasing  on  either  side  of  x  —  a  or 
consistently  decreasing  on  either  side  of  x  —  a,  then  there  is  neither  a  local  maximum  nor  a  local  minimum 
at  x  —  a.  We  can  prove  the  following  theorem  using  the  Mean  Value  Theorem. 


Theorem  5.41:  Intervals  of  Increase  and  Decrease 


>  0  for  every  x  in  an  interval,  then  f  is  increasing  on  that  interval. 
<  0  for  every  x  in  an  interval,  then  f  is  decreasing  on  that  interval. 


Proof.  We  will  prove  the  increasing  case.  The  proof  of  the  decreasing  case  is  similar.  Suppose  that 
/  (x)  >  0  on  an  interval  I.  Then  /  is  differentiable,  and  hence  also,  continuous  on  I.  If  xi  and  X2  are  any 
two  numbers  in  I  and  xi  <  xj,  then  /  is  continuous  on  [jci , JC2]  and  differentiable  on  (xi,X2)  ■  By  the  Mean 
Value  Theorem,  there  is  some  c  in  (xi,X2)  such  that 

ft  /  N  =  /(*2)~/(*l) 

X2~X\ 

But  c  must  be  in  I,  and  thus,  since  /'  (x)  >  0  for  every  x  in  I,  f  (c)  >  0.  Also,  since  x\  <  x%,  we  have 
X2  —  xi  >  0.  Therefore,  both  the  left  hand  side  and  the  denominator  of  the  right  hand  side  are  positive. 
It  follows  that  the  numerator  of  the  right  hand  must  be  positive.  That  is,  /(x 2)  —  /(x  1)  >0,  or  in  other 
words,  /  (xi )  <  /  (X2) .  This  shows  that  between  x\  and  X2  in  7,  the  larger  one,  X2,  necessarily  has  the  larger 
function  value,  f(x 2) ,  and  the  smaller  one,  xi,  necessarily  have  the  smaller  function  value,  /(x  1) .  This 
means  that  /  is  increasing  on  I.  X 


Example  5.42:  Local  Minimum  and  Maximum 


Consider  the  function  f  (x)  —  x4  —  2x2.  Find  where  f  is  increasing  and  where  f  is  decreasing.  Use 
this  information  to  hnd  the  local  maximum  and  minimum  points  of  f. 


Solution.  We  compute  f  (x)  and  analyze  its  sign. 

f  (x)  =  4x3  —  4x  =  4x  (x2  —  l)  =  4x(x—  1)  (x+  1) . 

The  solution  of  the  inequality  /'  (x)  >  0  is  (— 1,0)  U  (1,°°).  So,  /  is  increasing  on  the  interval  (  —  1,0)  and 
on  the  interval  (1,°°) .  The  solution  of  the  inequality  /'  (x)  <  0  is  (— °o,  —  l)  U  (0, 1) .  So,  /  is  decreasing  on 
the  interval  (—  °°,  —  1)  and  on  the  interval  (0, 1) .  Therefore,  at  the  critical  points  —1,0  and  1,  respectively, 
/  has  a  local  minimum,  a  local  maximum  and  a  local  minimum.  X 

Exercises  for  5.6.1 


Find  all  critical  points  and  identify  them  as  local  maximum  points,  local  minimum  points,  or  neither. 

Exercise  5.6.1  y  —  x~  —  x 
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Exercise  5.6.2  y  =  2  +  3x  —  x3 
Exercise  5.6.3  y  —  x3  —  9x2  +  24x 
Exercise  5.6.4  y  —  x4  —  2x2  +  3 
Exercise  5.6.5  v  =  3x4  —  4x3 
Exercise  5.6.6  y=  (x2  —  l)/x 
Exercise  5.6.7  y  =  3x2  —  (1/x2) 

Exercise  5.6.8  y  —  cos(2x)  —  x 
Exercise  5.6.9  /(x)  =  (5  —  x)/(x  +  2) 

Exercise  5.6.10  /(x)  =  |x2  —  121 1 
Exercise  5.6.11  /(x)  =  x3/(x  +  1) 

Exercise  5.6.12  /(x)  =  sin2x 

Exercise  5.6.13  Find  the  maxima  and  minima  of  f(x)  —  secx. 

Exercise  5.6.14  Let  f(6)  —  cos 2(0)  —  2sin(0).  Find  the  intervals  where  f  is  increasing  and  the  intervals 
where  f  is  decreasing  in  [0,  27t]  .  Use  this  information  to  classify  the  critical  points  of  f  as  either  local 
maximums,  local  minimums,  or  neither. 


Exercise  5.6.15  Let  r  >  0.  Find  the  local  maxima  and  minima  of  the  function  f(x)  —  \/ r2  —  x2  on  its 
domain  \—r,r\. 

Exercise  5.6.16  Let  /(x)  =  ax2  +  hx  +  c  with  a  f  0.  Show  that  f  has  exactly  one  critical  point  using  the 
first  derivative  test.  Give  conditions  on  a  and  b  which  guarantee  that  the  critical  point  will  be  a  maximum. 
It  is  possible  to  see  this  without  using  calculus  at  all;  explain. 


5.6.2.  The  Second  Derivative  Test 


The  basis  of  the  first  derivative  test  is  that  if  the  derivative  changes  from  positive  to  negative  at  a  point  at 
which  the  derivative  is  zero  then  there  is  a  local  maximum  at  the  point,  and  similarly  for  a  local  minimum. 
If  f  changes  from  positive  to  negative  it  is  decreasing;  this  means  that  the  derivative  of  f,  f",  might  be 
negative,  and  if  in  fact  f"  is  negative  then  f  is  definitely  decreasing.  From  this  we  determine  that  there  is 
a  local  maximum  at  the  point  in  question.  Note  that  f  might  change  from  positive  to  negative  while  f" 
is  zero,  in  which  case  f"  gives  us  no  information  about  the  critical  value.  Similarly,  if  f  changes  from 
negative  to  positive  there  is  a  local  minimum  at  the  point,  and  f  is  increasing.  If  f"  >  0  at  the  point,  this 
tells  us  that  f  is  increasing,  and  so  there  is  a  local  minimum. 
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Example  5.43:  Second  Derivative 


Consider  again  /(x)  =  sinx  +  cosx,  with  f\x )  =  cosx  —  sinx  and  f"(x)  —  —  sinx  —  cosx.  Use  the 
second  derivative  test  to  determine  which  critical  points  are  local  maxima  or  minima. 


Solution.  Since  /"(k/A)  —  —y/l/l  —  y/l/l  —  —y/2  <  0,  we  know  there  is  a  local  maximum  at  Jt/A.  Since 
f"{5n/4)  =  — (— a/2/2)  —  (— ■ y/l/T)  —  Vl  >  0,  there  is  a  local  minimum  at  57T/4.  X 

When  it  works,  the  second  derivative  test  is  often  the  easiest  way  to  identify  local  maximum  and  min¬ 
imum  points.  Sometimes  the  test  fails,  and  sometimes  the  second  derivative  is  quite  difficult  to  evaluate; 
in  such  cases  we  must  fall  back  on  one  of  the  previous  tests. 


Example  5.44:  Second  Derivative 


Let  f(x)  —  x4  and  g(x)  —  — x4.  Classify  the  critical  points  of  /(x)  and  g(x)  as  either  maximum  or 
minimum. 


Solution.  The  derivatives  for  /(x)  are  f'(x)  —  4x3  and  f"(x)  —  12x2.  Zero  is  the  only  critical  value,  but 
f"(0)  —  0,  so  the  second  derivative  test  tells  us  nothing.  However,  /(x)  is  positive  everywhere  except  at 
zero,  so  clearly  /(x)  has  a  local  minimum  at  zero. 

On  the  other  hand,  for  g(x)  =  — x4,  g'(x)  —  — 4x3  and  g"(x)  —  —  12x2.  So  g(x)  also  has  zero  as  its  only 
only  critical  value,  and  the  second  derivative  is  again  zero,  but  — x4  has  a  local  maximum  at  zero.  X 


Exercises  for  5.6.2 


Find  all  local  maximum  and  minimum  points  by  the  second  derivative  test. 

Exercise  5.6.17  y  =  x2  —  x 

Exercise  5.6.18  y  —  2  +  3x  —  x3 
Exercise  5.6.19  y  =  x3  -  9x2  +  24x 
Exercise  5.6.20  y  —  x4  —  2x2  +  3 
Exercise  5.6.21  y  =  3x4  —  4x3 
Exercise  5.6.22  y  —  (x2  —  l)/x 
Exercise  5.6.23  y  =  3x2  —  (1/x2) 

Exercise  5.6.24  y  —  cos(2x)  —  x 
Exercise  5.6.25  y  —  4x+  y/l—x 
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Exercise  5.6.26 
Exercise  5.6.27 
Exercise  5.6.28 
Exercise  5.6.29 
Exercise  5.6.30 
Exercise  5.6.31 
Exercise  5.6.32 
Exercise  5.6.33 
Exercise  5.6.34 


y  —  (*  +  l)/\/  5x2  +  35 
y  —  x5  —x 
y  =  6*  +  sin  3* 
y  —  x+  l/x 
y  —  x2  +  1  jx 
y  —  (x  +  5)1/4 
y  =  tan2* 
y  =  cos2*  —  sin2* 
y  =  sin3  * 


5.6.3.  Concavity  and  Inflection  Points 


We  know  that  the  sign  of  the  derivative  tells  us  whether  a  function  is  increasing  or  decreasing;  for  example, 
when  /'(*)  >  0,  /(*)  is  increasing.  The  sign  of  the  second  derivative  fix)  tells  us  whether  f  is  increasing 
or  decreasing;  we  have  seen  that  if  f  is  zero  and  increasing  at  a  point  then  there  is  a  local  minimum  at  the 
point.  If  f  is  zero  and  decreasing  at  a  point  then  there  is  a  local  maximum  at  the  point.  Thus,  we  extracted 
information  about  /  from  information  about  f". 

We  can  get  information  from  the  sign  of  f"  even  when  f  is  not  zero.  Suppose  that  f"(a)  >  0.  This 
means  that  near  x  —  a,f  is  increasing.  If  f(a)  >  0,  this  means  that  /  slopes  up  and  is  getting  steeper; 
if  f\a )  <  0,  this  means  that  /  slopes  down  and  is  getting  less  steep.  The  two  situations  are  shown  in 
figure  5.14.  A  curve  that  is  shaped  like  this  is  called  concave  up. 


~T~ 

a 


~r 

a 


Figure  5.14:  f" (a)  >  0:  f' (a)  positive  and  increasing,  f  (a)  negative  and  increasing. 


Now  suppose  that  f"(a)  <  0.  This  means  that  near  x  —  a,  f  is  decreasing.  If  f(a)  >  0,  this  means 
that  /  slopes  up  and  is  getting  less  steep;  if  f'(a)  <  0,  this  means  that  /  slopes  down  and  is  getting  steeper. 
The  two  situations  are  shown  in  figure  5.15.  A  curve  that  is  shaped  like  this  is  called  concave  down. 
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Figure  5.15:  f"(a)  <  0:  /'(a)  positive  and  decreasing,  f(a )  negative  and  decreasing. 


If  we  are  trying  to  understand  the  shape  of  the  graph  of  a  function,  knowing  where  it  is  concave  up 
and  concave  down  helps  us  to  get  a  more  accurate  picture.  Of  particular  interest  are  points  at  which 
the  concavity  changes  from  up  to  down  or  down  to  up;  such  points  are  called  inflection  points.  If  the 
concavity  changes  from  up  to  down  at  x  —  a,  f"  changes  from  positive  to  the  left  of  a  to  negative  to  the 
right  of  a ,  and  usually  /"(a)  =  0.  We  can  identify  such  points  by  first  finding  where  f"(x)  is  zero  and  then 
checking  to  see  whether  f"(x)  does  in  fact  go  from  positive  to  negative  or  negative  to  positive  at  these 
points.  Note  that  it  is  possible  that  /"(a)  =  0  but  the  concavity  is  the  same  on  both  sides;  f(x)  —  x4  at 
x  =  0  is  an  example. 


Solution.  The  derivatives  are  f'(x)  —  3x2  —  1  and  f"(x)  =  6x.  Since  /"( 0)  =  0,  there  is  potentially  an 
inflection  point  at  zero.  Since  f"{x)  >  0  when  x  >  0  and  f"(x)  <  0  when  x  <  0  the  concavity  does  change 
from  concave  down  to  concave  up  at  zero,  and  the  curve  is  concave  down  for  all  x  <  0  and  concave  up  for 
all  x  >  0.  X 

Note  that  we  need  to  compute  and  analyze  the  second  derivative  to  understand  concavity,  so  we  may 
as  well  try  to  use  the  second  derivative  test  for  maxima  and  minima.  If  for  some  reason  this  fails  we  can 
then  try  one  of  the  other  tests. 


Exercises  for  5.6.3 


Describe  the  concavity  of  the  functions  below. 

Exercise  5.6.35  y  —  x2  —  x 

Exercise  5.6.36  y  —  2  +  3x  —  x3 
Exercise  5.6.37  y  —  x3  —  9x2  +  24x 
Exercise  5.6.38  y  —  x4  —  2x2  +  3 
Exercise  5.6.39  y  =  3x4  —  4x3 
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Exercise  5.6.40  y  —  (x2  —  l)/x 
Exercise  5.6.41  y  =  3x2  — (1/x2) 

Exercise  5.6.42  y  =  sinx  +  cosx 
Exercise  5.6.43  y  —  4x+  \J  1  —  x 
Exercise  5.6.44  y  —  (x  +  l)/\/5x2  +  35 
Exercise  5.6.45  y  —  x’  —  x 
Exercise  5.6.46  y  —  6x  +  sin3x 
Exercise  5.6.47  y  —  x  +  1/x 
Exercise  5.6.48  y  —  x2  +  1/x 
Exercise  5.6.49  y  —  (x  +  5)1/4 
Exercise  5.6.50  y  =  tan2x 
Exercise  5.6.51  y  —  cos2x—  sin2x 
Exercise  5.6.52  y  —  sin3x 

Exercise  5.6.53  Identify  the  intervals  on  which  the  graph  of  the  function  f(x)  —  x4  —  4x3  +  10  is  of  one  of 
these  four  shapes:  concave  up  and  increasing;  concave  up  and  decreasing;  concave  down  and  increasing; 
concave  down  and  decreasing. 

Exercise  5.6.54  Describe  the  concavity  of  y  —  x  +bx  +cx  +  d.  You  will  need  to  consider  different  cases, 
depending  on  the  values  of  the  coefficients. 

Exercise  5.6.55  Let  n  be  an  integer  greater  than  or  equal  to  two,  and  suppose  f  is  a  polynomial  of  degree 
n.  How  many  inflection  points  can  f  have?  Hint:  Use  the  second  derivative  test  and  the  fundamental 
theorem  of  algebra. 

5.6.4.  Asymptotes  and  Other  Things  to  Look  For 


A  vertical  asymptote  is  a  place  where  the  function  becomes  infinite,  typically  because  the  formula  for 
the  function  has  a  denominator  that  becomes  zero.  For  example,  the  reciprocal  function  /(x)  =  1/x  has 
a  vertical  asymptote  at  x  =  0,  and  the  function  tanx  has  a  vertical  asymptote  at  x  =  n/2  (and  also  at 
x  =  —  7t/2,  x  =  3tt/2,  etc.).  Whenever  the  formula  for  a  function  contains  a  denominator  it  is  worth 
looking  for  a  vertical  asymptote  by  checking  to  see  if  the  denominator  can  ever  be  zero,  and  then  checking 
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the  limit  at  such  points.  Note  that  there  is  not  always  a  vertical  asymptote  where  the  derivative  is  zero: 
/(x)  —  (sinx)  f x  has  a  zero  denominator  at  x  —  0,  but  since  lim(sinx)  jx  —  1  there  is  no  asymptote  there. 

x->0 


A  horizontal  asymptote  is  a  horizontal  line  to  which  /(x)  gets  closer  and  closer  as  x  approaches  °° 
(or  as  x  approaches  —  °°).  For  example,  the  reciprocal  function  has  the  x-axis  for  a  horizontal  asymptote. 
Horizontal  asymptotes  can  be  identified  by  computing  the  limits  lim  /(x)  and  lim  fix).  Since  lim  1/x  = 

x— >°°  x— >— °°  x— 


lim  1/x  =  0,  the  line  y  =  0  (that  is,  the  x-axis)  is  a  horizontal  asymptote  in  both  directions. 

X— >— OO 


Some  functions  have  asymptotes  that  are  neither  horizontal  nor  vertical,  but  some  other  line.  Such 
asymptotes  are  somewhat  more  difficult  to  identify  and  we  will  ignore  them. 


If  the  domain  of  the  function  does  not  extend  out  to  infinity,  we  should  also  ask  what  happens  as  x 
approaches  the  boundary  of  the  domain.  For  example,  the  function  y  —  /(x)  =  1/ \f  r-  —  x* 1 2 3 4 5 6 7  has  domain 
— r  <  x  <  r,  and  y  becomes  infinite  as  x  approaches  either  r  or  —  r.  In  this  case  we  might  also  identify  this 
behavior  because  when  x  —  ±r  the  denominator  of  the  function  is  zero. 


If  there  are  any  points  where  the  derivative  fails  to  exist  (a  cusp  or  corner),  then  we  should  take  special 
note  of  what  the  function  does  at  such  a  point. 

Finally,  it  is  worthwhile  to  notice  any  symmetry.  A  function  /(x)  that  has  the  same  value  for  — x  as 
for  x,  i.e.,  /(—  x)  =  /(x),  is  called  an  “even  function.”  Its  graph  is  symmetric  with  respect  to  the  y-axis. 
Some  examples  of  even  functions  are:  x"  when  n  is  an  even  number,  cosx,  and  sin2x.  On  the  other  hand, 
a  function  that  satisfies  the  property  /(—  x)  =  — /(x)  is  called  an  “odd  function.”  Its  graph  is  symmetric 
with  respect  to  the  origin.  Some  examples  of  odd  functions  are:  xn  when  n  is  an  odd  number,  sinx,  and 
tanx.  Of  course,  most  functions  are  neither  even  nor  odd,  and  do  not  have  any  particular  symmetry. 


5.6.5.  Summary  of  Curve  Sketching 


The  following  is  a  guideline  for  sketching  a  curve  y  =  /(x)  by  hand.  Each  item  may  not  be  relevant  to 
the  function  in  question,  but  utilizing  this  guideline  will  provide  all  information  needed  to  make  a  detailed 
sketch  of  the  function. 


Guideline  for  Curve  Sketching 


1 .  Domain  of  the  function 

2.  x-  and  y-Intercepts 

3.  Symmetry 

4.  Vertical  and  Horizontal  Asymptotes 

5.  Intervals  of  Increase/Decrease,  and  Local  Extrema 

6.  Concavity  and  Points  of  Inflection 

7.  Sketch  the  Graph 
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Solution. 

1.  The  domain  is  {x  :  x2  —  1  ^  0}  =  {x  :  x  ^  ±1}  =  (— °°,  — 1)  U  (  —  1, 1)  U  (1,°°) 

2.  There  is  an  x-intercept  at  x  —  0.  The  y  intercept  is  y  =  0. 

3.  /(— x)  =  fix),  so  /  is  an  even  function  (symmetric  about  y-axis) 

2x2  2 

4.  lim  — —  =  lim  -  ,  _  =  2,  so  y  =  2  is  a  horizontal  asymptote. 

x^±oo  x2  _  1  >±oo  1  -  1  /x2  " 

Now  the  denominator  is  0  at  x  =  ±1,  so  we  compute: 

2x2  .  2x2  .  2x2  .  2x2 

lim  — z - =  +oo,  hm  - =  — oo,  lim  — = - =  —  °°,  lim  - =  +°°. 

jr— >1+  X"  —  1  x—t  I "  Xz  —  1  x->-l+Xz—  1  jc-l-l-X2—  1 

So  the  lines  x  =  1  and  x  =  —  1  are  vertical  asymptotes. 

5.  For  critical  values  we  take  the  derivative: 

,  4x(x2  —  1)  —  2x2  ■  2x  — 4x 

Note  that  f'{x)  —  0  when  x  =  0  (the  top  is  zero).  Also,  f' (x)  =  ONE  when  x  =  ±1  (the  bottom  is 
zero).  As  x  =  ±1  is  not  in  the  domain  of  /(x),  the  only  critical  number  is  x  =  0  (recall  that  to  be  a 
critical  number  we  need  it  to  be  in  the  domain  of  the  original  function). 

Drawing  a  number  line  and  including  all  of  the  split  points  of  f'(x )  we  have: 

f  '(-2)  >  0  f '(-0.5)  >  0  f  '(0.5)  <  0  f '(2)  <  0 

+  +  -  - 

- 1 - 1 - 1 - 

-10  1 

inc  inc  dec  dec 


Thus  /  is  increasing  on  ( — °°,  —  1 )  U  ( — 1,0)  and  decreasing  on  (0, 1 )  U  ( 1 ,  °°) . 
By  the  first  derivative  test,  x  =  0  is  a  local  max. 


6.  For  possible  inflection  points  we  take  the  second  derivative: 


fix) 


12x2  +  4 

(x2  - 1)3 


The  top  is  never  zero.  Also,  the  bottom  is  only  zero  when  x  =  ±  1  (neither  of  which  are  in  the  domain 
of  /(x)).  Thus,  there  are  no  possible  inflection  points  to  consider. 
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Drawing  a  number  line  and  including  all  of  the  split  points  of  f"(x)  we  have: 

f"(-2)>0  f"(0)<0  f  "(2)  >  0 


-1  1 
CU  CD  CU 

Hence  /  is  concave  up  on  ( — °°,  —  1 )  U  ( 1 ,  °°) ,  concave  down  on  ( —  1 , 1 ) . 

7.  We  put  this  information  together  and  sketch  the  graph. 

We  combine  some  of  this  information  on  a  single  number  line  to  see  what  shape  the  graph  has  on 
certain  intervals: 

CU  VA  CD  max  CD  VA  CU 
- 1 - 1 - 

-1  0 

inc  inc  dec 

j  r 

Note  that  there  is  a  horizontal  asymptote  at  y  —  2  and  that  the 
y  —  0.  Therefore,  a  sketch  of  f(x)  is  as  follows: 


dec 


curve  has  jc-int  of  x  —  0  and  y-int  of 


X 


Exercises  for  5.6 


Sketch  the  curves.  Identify  clearly  any  interesting  features,  including  local  maximum  and  minimum  points, 
inflection  points,  asymptotes,  and  intercepts. 

Exercise  5.6.56  y  —  x5  —  5x4  +  5x3 


Exercise  5.6.57  y  —  x3  —  3x2  —  9x  +  5 


Exercise  5.6.58  y  —  (x  —  l)2(x  +  3)2/3 
Exercise  5.6.59  x2  +  x2  y2  =  cry2,  a  >  0. 
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Exercise  5.6.60  v  =  xex 
Exercise  5.6.61  y  =  (e*  +  e~x)/2 
Exercise  5.6.62  v  =  e~x cos x 
Exercise  5.6.63  y  —  ex  —  sinx 
Exercise  5.6.64  v  =  ev/x 
Exercise  5.6.65  v  =  4x  +  y/\  — x 
Exercise  5.6.66  y  =  (x+  1 )/ \/ 5x2  +  35 
Exercise  5.6.67  y  —  x5  —  x 
Exercise  5.6.68  y  =  6x  +  sin3x 
Exercise  5.6.69  y  =  x  +  1/x 
Exercise  5.6.70  y  —  x2  +  1/x 
Exercise  5.6.71  y  =  (x  +  5)1^4 
Exercise  5.6.72  y  =  tan2x 
Exercise  5.6.73  v  =  cos2  x  —  sin2  x 
Exercise  5.6.74  v  =  sin3x 
Exercise  5.6.75  v  =  x(x2  +  1) 

Exercise  5.6.76  y  =  x3  +  6x2  +  9x 
Exercise  5.6.77  y  —  x/  (x2  —  9) 

Exercise  5.6.78  y  —  x1  /(x2  +  9) 

Exercise  5.6.79  v  =  2^/x  —  x 

Exercise  5.6.80  y  —  3  sin(x)  —  sin3  (x),  for  x  G  [0, 2 n\ 

Exercise  5.6.81  y  =  (x—  l)/(x2) 
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5.7  Optimization  Problems 


Many  important  applied  problems  involve  finding  the  best  way  to  accomplish  some  task.  Often  this  in¬ 
volves  finding  the  maximum  or  minimum  value  of  some  function:  the  minimum  time  to  make  a  certain 
journey,  the  minimum  cost  for  doing  a  task,  the  maximum  power  that  can  be  generated  by  a  device,  and  so 
on.  Many  of  these  problems  can  be  solved  by  finding  the  appropriate  function  and  then  using  techniques 
of  calculus  to  find  the  maximum  or  the  minimum  value  required. 

Generally  such  a  problem  will  have  the  following  mathematical  form:  Find  the  largest  (or  smallest) 
value  of  /(x)  when  a  <x  <  b.  Sometimes  a  or  b  are  infinite,  but  frequently  the  real  world  imposes  some 
constraint  on  the  values  that  x  may  have. 

Such  a  problem  differs  in  two  ways  from  the  local  maximum  and  minimum  problems  we  encountered 
when  graphing  functions:  We  are  interested  only  in  the  function  between  a  and  b,  and  we  want  to  know 
the  largest  or  smallest  value  that  /(x)  takes  on,  not  merely  values  that  are  the  largest  or  smallest  in  a 
small  interval.  That  is,  we  seek  not  a  local  maximum  or  minimum  but  a  global  (or  absolute )  maximum  or 
minimum. 


Guidelines  to  solving  an  optimization  problem. 


1.  Understand  clearly  what  is  to  be  maximized  or  minimized  and  what  the  contraints  are. 

2.  Draw  a  diagram  (if  appropriate)  and  label  it. 

3.  Decide  what  the  variables  are.  For  example,  A  for  area,  r  for  radius,  C  for  cost. 

4.  Write  a  formula  for  the  function  for  which  you  wish  to  find  the  maximum  or  minimum. 

5.  Express  that  formula  in  terms  of  only  one  variable,  that  is,  in  the  form  /(x).  Usually  this  is 
accomplished  by  using  the  given  constraints. 

6.  Set  f'(x)  =  0  and  solve.  Check  all  critical  values  and  endpoints  to  determine  the  extreme 
value(s)  of  /(x). 


Example  5.47:  Largest  Rectangle 


Find  the  largest  rectangle  (that  is,  the  rectangle  with  largest  area)  that  fits  inside  the  graph  of  the 
parabola  y  —  x2  below  the  line  y  —  a  (a  is  an  unspecified  constant  value),  with  the  top  side  of  the 
rectangle  on  the  horizontal  line  y  —  a;  see  Figure  5.16.) 


Solution.  We  want  to  find  the  maximum  value  of  some  function  A(x)  representing  area.  Perhaps  the 
hardest  part  of  this  problem  is  deciding  what  x  should  represent.  The  lower  right  corner  of  the  rectangle  is 
at  (x,x2),  and  once  this  is  chosen  the  rectangle  is  completely  determined.  So  we  can  let  the  x  in  A(x)  be 
the  x  of  the  parabola  /(x)  —  x2.  Then  the  area  is 

A(x)  —  (2x)(a  —  x2)  —  — 2x3  +  2  ax. 
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We  want  the  maximum  value  of  A(x)  when  x  is  in  [0,  \/a] .  (You  might  object  to  allowing  x  —  0  or  x  —  \fci , 
since  then  the  “rectangle”  has  either  no  width  or  no  height,  so  is  not  “really”  a  rectangle.  But  the  problem 
is  somewhat  easier  if  we  simply  allow  such  rectangles,  which  have  zero  area.) 

Setting  0  =  A' (x)  —  6x2  +  2 a  we  get  x  =  \Ja/3  as  the  only  critical  value.  Testing  this  and  the  two 
endpoints,  we  have  A(0)  =  A(y/a)  —  0  and  A(\/a/ 3)  =  (4/9)  v^a3/2.  The  maximum  area  thus  occurs 
when  the  rectangle  has  dimensions  2^/a/3  x  (2/3 )a.  4 


Example  5.48:  Largest  Cone 


If  you  fit  the  largest  possible  cone  inside  a  sphere,  what  fraction  of  the  volume  of  the  sphere  is 
occupied  by  the  cone?  (Here  by  “cone”  we  mean  a  right  circular  cone,  i.e.,  a  cone  for  which  the 
base  is  perpendicular  to  the  axis  of  symmetry,  and  for  which  the  cross-section  cut  perpendicular  to 
the  axis  of  symmetry  at  any  point  is  a  circle.) 


Solution.  Let  R  be  the  radius  of  the  sphere,  and  let  r  and  h  be  the  base  radius  and  height  of  the  cone  inside 
the  sphere.  What  we  want  to  maximize  is  the  volume  of  the  cone:  nr2h/ 3.  Here  R  is  a  fixed  value,  but  r 
and  h  can  vary.  Namely,  we  could  choose  r  to  be  as  large  as  possible — equal  to  R — by  taking  the  height 
equal  to  R;  or  we  could  make  the  cone’s  height  h  larger  at  the  expense  of  making  r  a  little  less  than  R.  See 
the  cross-section  depicted  in  Figure  5.17.  We  have  situated  the  picture  in  a  convenient  way  relative  to  the 
x  and  y  axes,  namely,  with  the  center  of  the  sphere  at  the  origin  and  the  vertex  of  the  cone  at  the  far  left  on 
the  x-axis. 

Notice  that  the  function  we  want  to  maximize,  nr2h/ 3,  depends  on  two  variables.  This  is  frequently 
the  case,  but  often  the  two  variables  are  related  in  some  way  so  that  “really”  there  is  only  one  variable.  So 
our  next  step  is  to  find  the  relationship  and  use  it  to  solve  for  one  of  the  variables  in  terms  of  the  other,  so 
as  to  have  a  function  of  only  one  variable  to  maximize.  In  this  problem,  the  condition  is  apparent  in  the 
figure:  the  upper  corner  of  the  triangle,  whose  coordinates  are  ( h  —  R ,  r ),  must  be  on  the  circle  of  radius  R. 
That  is, 

(h-R)2  +  r2  =R2. 

We  can  solve  for  h  in  terms  of  r  or  for  r  in  terms  of  h.  Either  involves  taking  a  square  root,  but  we  notice  that 

r\  nt  ry  ry  ry 

the  volume  function  contains  r  ,  not  r  by  itself,  so  it  is  easiest  to  solve  for  r  directly:  r  —  R~  —  (h  —  R). 
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Then  we  substitute  the  result  into  nr2h/3: 

V{h)  =  n(R2-(h-R)2)h/3 
—  —  |  h3  +  \nh2R 


We  want  to  maximize  V(h)  when  h  is  between  0  and  2 R.  Now  we  solve  0  =  f'(h)  —  —Kh2  +  (4/3)7 zhR, 
getting  h  —  0  or  h  =  4R/3.  We  compute  V(0)  =  V(2 R)  —  0  and  V(4R/3)  —  (32/81)7 xR3 .  The  maximum 
is  the  latter;  since  the  volume  of  the  sphere  is  (4/3)7 zR3,  the  fraction  of  the  sphere  occupied  by  the  cone  is 


(32/81)7 zR3 
(4/3)nR3 


8 

27 


«  30%. 


X 


Example  5.49:  Containers  of  Given  Volume 


You  are  making  cylindrical  containers  to  contain  a  given  volume.  Suppose  that  the  top  and  bottom 
are  made  of  a  material  that  is  N  times  as  expensive  (cost  per  unit  area)  as  the  material  used  for  the 
lateral  side  of  the  cylinder. 

Find  (in  terms  of  N)  the  ratio  of  height  to  base  radius  of  the  cylinder  that  minimizes  the  cost  of 
making  the  containers. 


Solution.  Let  us  first  choose  letters  to  represent  various  things:  h  for  the  height,  r  for  the  base  radius,  V 
for  the  volume  of  the  cylinder,  and  c  for  the  cost  per  unit  area  of  the  lateral  side  of  the  cylinder;  V  and  c 
are  constants,  h  and  r  are  variables.  Now  we  can  write  the  cost  of  materials: 


c(2nrh)  +  Nc(2nr2). 

Again  we  have  two  variables;  the  relationship  is  provided  by  the  fixed  volume  of  the  cylinder:  V  —  nr  h. 
We  use  this  relationship  to  eliminate  h  (we  could  eliminate  r,  but  it’s  a  little  easier  if  we  eliminate  h,  which 
appears  in  only  one  place  in  the  above  formula  for  cost).  The  result  is 


V  o  2cV  n 

f(r)  —  2cnr — y  +  2Ncnr~  = - b  2Ncnrz. 


nr- 


r 
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We  want  to  know  the  minimum  value  of  this  function  when  r  is  in  (0,°°).  We  now  set  0  =  f'(r)  = 
—2  cV /r2  +  AN  cur,  giving  r  —  {/V  /  (2Nn).  Since  f"(r )  =  A  cV /r3  +  ANcn  is  positive  when  r  is  positive, 
there  is  a  local  minimum  at  the  critical  value,  and  hence  a  global  minimum  since  there  is  only  one  critical 
value. 

Finally,  since  h  =  V / {nr2), 

h  =  Y  = _ Y. _ =  2n, 

r  nr 3  n(V/(2Nn)) 

so  the  minimum  cost  occurs  when  the  height  h  is  IN  times  the  radius.  If,  for  example,  there  is  no  difference 
in  the  cost  of  materials,  the  height  is  twice  the  radius  (or  the  height  is  equal  to  the  diameter).  £ 


Example  5.50:  Rectangles  of  Given  Area 


Of  all  rectangles  of  area  100,  which  has  the  smallest  perimeter? 


Solution.  First  we  must  translate  this  into  a  purely  mathematical  problem  in  which  we  want  to  find  the 
minimum  value  of  a  function.  If  x  denotes  one  of  the  sides  of  the  rectangle,  then  the  adjacent  side  must  be 
100/x  (in  order  that  the  area  be  100).  So  the  function  we  want  to  minimize  is 


fix)  =  2x  +  2 


100 

x 


since  the  perimeter  is  twice  the  length  plus  twice  the  width  of  the  rectangle.  Not  all  values  of  x  make  sense 
in  this  problem:  lengths  of  sides  of  rectangles  must  be  positive,  so  x  >  0.  If  x  >  0  then  so  is  100/x,  so  we 
need  no  second  condition  on  x. 

We  next  find  f'{x)  and  set  it  equal  to  zero:  0  =  f (x)  —2  —  200/x2.  Solving  f (x)  =  0  for  x  gives  us 
x  =  ±10.  We  are  interested  only  in  x  >  0,  so  only  the  value  x  =  10  is  of  interest.  Since  f'{x )  is  defined 
everywhere  on  the  interval  (0,°°),  there  are  no  more  critical  values,  and  there  are  no  endpoints.  Is  there  a 
local  maximum,  minimum,  or  neither  atx  =  10?  The  second  derivative  is  f"  (x)  =  400/x3,  and/"  (10)  >  0, 
so  there  is  a  local  minimum.  Since  there  is  only  one  critical  value,  this  is  also  the  global  minimum,  so  the 
rectangle  with  smallest  perimeter  is  the  10  x  10  square.  X 


Example  5.51:  Maximize  your  Profit 


You  want  to  sell  a  certain  number  n  of  items  in  order  to  maximize  your  profit.  Market  research  tells 
you  that  if  you  set  the  price  at  $1.50,  you  will  be  able  to  sell  5000  items,  and  for  every  1 0  cents  you 
lower  the  price  below  $1.50  you  will  be  able  to  sell  another  1000  items.  Suppose  that  your  fixed 
costs  (“start-up  costs”)  total  $2000,  and  the  per  item  cost  of  production  (“marginal  cost”)  is  $0.50. 
Find  the  price  to  set  per  item  and  the  number  of  items  sold  in  order  to  maximize  profit,  and  also 
determine  the  maximum  profit  you  can  get. 


Solution.  The  first  step  is  to  convert  the  problem  into  a  function  maximization  problem.  Since  we  want 
to  maximize  profit  by  setting  the  price  per  item,  we  should  look  for  a  function  P(x)  representing  the  profit 
when  the  price  per  item  is  x.  Profit  is  revenue  minus  costs,  and  revenue  is  number  of  items  sold  times 
the  price  per  item,  so  we  get  P  —  nx  —  2000  —  0.50/l  The  number  of  items  sold  is  itself  a  function  of  x, 
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n  —  5000+  1000(1.5  —  x)/0.10,  because  (1.5  —  x)/0.10  is  the  number  of  multiples  of  10  cents  that  the 
price  is  below  $1.50.  Now  we  substitute  for  n  in  the  profit  function: 

P{x)  =  (5000  +  1000(  1 .5  -  x)/0. 10)x  -  2000  -  0.5  (5000  +  1000(  1 .5  -  x)  /0. 10) 

=  —  lOOOOx2  +  25000.x  -  12000 


We  want  to  know  the  maximum  value  of  this  function  when  x  is  between  0  and  1.5.  The  derivative  is 
P'(x)  —  — 20000x  +  25000,  which  is  zero  when  x  =  1.25.  Since  P"(x)  —  —20000  <  0,  there  must  be  a 
local  maximum  at  x  =  1.25,  and  since  this  is  the  only  critical  value  it  must  be  a  global  maximum  as  well. 
(Alternately,  we  could  compute  P(0)  =  —12000,  P(1.25)  =  3625,  and  P(  1.5)  =  3000  and  note  thatP(1.25) 
is  the  maximum  of  these.)  Thus  the  maximum  profit  is  $3625,  attained  when  we  set  the  price  at  $1.25  and 
sell  7500  items.  X 


Example  5.52:  Minimize  Travel  Time 


Suppose  you  want  to  reach  a  point  A  that  is  located  across  the  sand  from  a  nearby  road  (see  Fig¬ 
ure  5.18).  Suppose  that  the  road  is  straight,  and  b  is  the  distance  from  A  to  the  closest  point  C  on 
the  road.  Let  v  be  your  speed  on  the  road,  and  let  w,  which  is  less  than  v,  be  your  speed  on  the  sand. 
Right  now  you  are  at  the  point  D,  which  is  a  distance  a  from  C.  At  what  point  B  should  you  turn  off 
the  road  and  head  across  the  sand  in  order  to  minimize  your  travel  time  to  A? 


Solution.  Let  x  be  the  distance  short  of  C  where  you  turn  off,  i.e.,  the  distance  from  B  to  C.  We  want  to 
minimize  the  total  travel  time.  Recall  that  when  traveling  at  constant  velocity,  time  is  distance  divided  by 
velocity. 

You  travel  the  distance  DB  at  speed  v,  and  then  the  distance  BA  at  speed  w.  Since  DB  —  a  —  x  and,  by 
the  Pythagorean  theorem,  BA  =  V x2  +b2,  the  total  time  for  the  trip  is 


/(-*) 


a  —  x 
- h 

V 


Vx2  +  b2 
w 


We  want  to  find  the  minimum  value  of  /  when  x  is  between  0  and  a.  As  usual  we  set  f(x)  =  0  and  solve 
for  x: 


0  =  f'(x) 


1  x 

v  w\J x2  +  b2 

tv\/ x2  +  b2  —  vx 
w2(x2  +  b2)  =  v2x2 
w2b2  =  (v2  —  w2)x2 
wb 

\J  v 2  —  w2 


Notice  that  a  does  not  appear  in  the  last  expression,  but  a  is  not  irrelevant,  since  we  are  interested  only  in 
critical  values  that  are  in  [0,a],  and  wb/  y/v2  —  w2  is  either  in  this  interval  or  not.  If  it  is,  we  can  use  the 
second  derivative  to  test  it: 


(x2  +  £>2)3/2vr 
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Since  this  is  always  positive  there  is  a  local  minimum  at  the  critical  point,  and  so  it  is  a  global  minimum 
as  well. 

If  the  critical  value  is  not  in  [0,a]  it  is  larger  than  a.  In  this  case  the  minimum  must  occur  at  one  of  the 
endpoints.  We  can  compute 


m 

f(a) 


a  b 
—  — I — 
v  w 

V  a2  +  b2 
w 


but  it  is  difficult  to  determine  which  of  these  is  smaller  by  direct  comparison.  If,  as  is  likely  in  practice, 
we  know  the  values  of  v,  w,  a,  and  b,  then  it  is  easy  to  determine  this.  With  a  little  cleverness,  however,  we 
can  determine  the  minimum  in  general.  We  have  seen  that  f"(x)  is  always  positive,  so  the  derivative  f'{x) 
is  always  increasing.  We  know  that  at  wb/  \J  v2  —  w2  the  derivative  is  zero,  so  for  values  of  x  less  than  that 
critical  value,  the  derivative  is  negative.  This  means  that  /( 0)  >  /(a),  so  the  minimum  occurs  when  x  —  a. 

So  the  upshot  is  this:  If  you  start  farther  away  from  C  than  wb/  a/ y2  —  w2  then  you  always  want  to  cut 
across  the  sand  when  you  are  a  distance  wb/  \J v2  —  w2  from  point  C.  If  you  start  closer  than  this  to  C,  you 
should  cut  directly  across  the  sand.  X 


A 


Figure  5.18:  Minimizing  travel  time. 


Exercises  for  Section  5.7 


Exercise  5.7.1  Find  the  dimensions  of  the  rectangle  of  largest  area  having  fixed  perimeter  100. 

Exercise  5.7.2  Find  the  dimensions  of  the  rectangle  of  largest  area  having  fixed  perimeter  P. 

Exercise  5.7.3  A  box  with  square  base  and  no  top  is  to  hold  a  volume  100.  Find  the  dimensions  of  the  box 
that  requires  the  least  material  for  the  five  sides.  Also  find  the  ratio  of  height  to  side  of  the  base. 

Exercise  5.7.4  A  box  with  square  base  is  to  hold  a  volume  200.  The  bottom  and  top  are  formed  by 
folding  in  flaps  from  all  four  sides,  so  that  the  bottom  and  top  consist  of  two  layers  of  cardboard.  Find  the 
dimensions  of  the  box  that  requires  the  least  material.  Also  find  the  ratio  of  height  to  side  of  the  base. 
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Exercise  5.7.5  A  box  with  square  base  and  no  top  is  to  hold  a  volume  V.  Find  ( in  terms  of  V)  the 
dimensions  of  the  box  that  requires  the  least  material  for  the  five  sides.  Also  find  the  ratio  of  height  to  side 
of  the  base.  (This  ratio  will  not  involve  V.) 

Exercise  5.7.6  You  have  100  feet  offence  to  make  a  rectangular  play  area  alongside  the  wall  of  your 
house.  The  wall  of  the  house  bounds  one  side.  What  is  the  largest  size  possible  ( in  square  feet)  for  the  play 
area  ? 

Exercise  5.7.7  You  have  l  feet  offence  to  make  a  rectangular  play  area  cdongside  the  wcdl  of  your  house. 
The  wcdl  of  the  house  bounds  one  side.  What  is  the  largest  size  possible  ( in  square  feet)  for  the  play  area  ? 

Exercise  5.7.8  Marketing  tells  you  that  if  you  set  the  price  of  an  item  at  $10  then  you  will  be  unable  to 
sell  it,  but  that  you  can  sell  500  items  for  each  dollar  below  $10  that  you  set  the  price.  Suppose  your  fixed 
costs  total  $3000,  and  your  margined  cost  is  $2  per  item.  What  is  the  most  profit  you  can  make? 

Exercise  5.7.9  Find  the  area  of  the  largest  rectangle  that  fits  inside  a  semicircle  of  radius  10  (one  side  of 
the  rectangle  is  along  the  diameter  of  the  semicircle). 

Exercise  5.7.10  Find  the  area  of  the  largest  rectangle  that  fits  inside  a  semicircle  of  radius  r  (one  side  of 
the  rectangle  is  cdong  the  diameter  of  the  semicircle). 

Exercise  5.7.11  For  a  cylinder  with  surface  area  50,  including  the  top  and  the  bottom,  find  the  ratio  of 
height  to  base  radius  that  maximizes  the  volume. 

Exercise  5.7.12  For  a  cylinder  with  given  surface  area  S,  including  the  top  and  the  bottom,  find  the  ratio 
of  height  to  base  radius  that  maximizes  the  volume. 

Exercise  5.7.13  You  want  to  make  cylindrical  containers  to  hold  1  liter  using  the  least  amount  of  con¬ 
struction  material.  The  side  is  made  from  a  rectangular  piece  of  material,  and  this  can  be  done  with  no 
material  wasted.  However,  the  top  and  bottom  are  cut  from  squares  of  side  2 r,  so  that  2(2 r)2  =  8r2  of 
material  is  needed  (rather  than  2nr,  which  is  the  total  area  of  the  top  and  bottom).  Find  the  dimensions 
of  the  container  using  the  least  amount  of  material,  and  also  find  the  ratio  of  height  to  radius  for  this 
container. 

Exercise  5.7.14  You  want  to  make  cylindrical  containers  of  a  given  volume  V  using  the  least  amount  of 
construction  material.  The  side  is  made  from  a  rectangular  piece  of  material,  and  this  can  be  done  with 
no  material  wasted.  However,  the  top  and  bottom  are  cut  from  squares  of  side  2  r,  so  that  2(2  r)2  =  8r2  of 
material  is  needed  (rather  than  2k  r  ,  which  is  the  toted  area  of  the  top  and  bottom).  Find  the  optimal  ratio 
of  height  to  radius. 

Exercise  5.7.15  Given  a  right  circular  cone,  you  put  an  upside-down  cone  inside  it  so  that  its  vertex  is 
at  the  center  of  the  base  of  the  larger  cone  and  its  base  is  parallel  to  the  base  of  the  larger  cone.  If  you 
choose  the  upside-down  cone  to  have  the  largest  possible  volume,  what  fraction  of  the  volume  of  the  larger 
cone  does  it  occupy?  (Let  H  and  R  be  the  height  and  base  radius  of  the  larger  cone,  and  let  h  and  r  be  the 
height  and  base  radius  of  the  smaller  cone.  Hint:  Use  similar  triangles  to  get  an  equation  relating  h  and 
r.) 
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Exercise  5.7.16  A  container  holding  a  fixed  volume  is  being  made  in  the  shape  of  a  cylinder  with  a 
hemispherical  top.  (The  hemispherical  top  has  the  same  radius  as  the  cylinder.)  Find  the  ratio  of  height 
to  radius  of  the  cylinder  which  minimizes  the  cost  of  the  container  if  (a)  the  cost  per  unit  area  of  the  top  is 
twice  as  great  as  the  cost  per  unit  area  of  the  side,  and  the  container  is  made  with  no  bottom;  (b)  the  same 
as  in  (a),  except  that  the  container  is  made  with  a  circular  bottom,  for  which  the  cost  per  unit  area  is  1.5 
times  the  cost  per  unit  area  of  the  side. 

Exercise  5.7.17  A  piece  of  cardboard  is  1  meter  by  1/2  meter.  A  square  is  to  be  cut  from  each  corner  and 
the  sides  folded  up  to  make  an  open-top  box.  What  are  the  dimensions  of  the  box  with  maximum  possible 
volume  ? 

Exercise  5.7.18  (a)  A  square  piece  of  cardboard  of  side  a  is  used  to  make  an  open-top  box  by  cutting  out 
a  small  square  from  each  corner  and  bending  up  the  sides.  How  large  a  square  should  be  cut  from  each 
corner  in  order  that  the  box  have  maximum  volume?  ( b)  What  if  the  piece  of  cardboard  used  to  make  the 
box  is  a  rectangle  of  sides  a  and  b? 

Exercise  5.7.19  A  window  consists  of  a  rectangular  piece  of  clear  glass  with  a  semicircular  piece  of 
colored  glass  on  top;  the  colored  glass  transmits  only  1/2  as  much  light  per  unit  area  as  the  the  clear 
glass.  If  the  distance  from  top  to  bottom  (across  both  the  rectangle  and  the  semicircle)  is  2  meters  and 
the  window  may  be  no  more  than  1.5  meters  wide,  find  the  dimensions  of  the  rectangular  portion  of  the 
window  that  lets  through  the  most  light. 

Exercise  5.7.20  A  window  consists  of  a  rectangular  piece  of  clear  glass  with  a  semicircular  piece  of 
colored  glass  on  top.  Suppose  that  the  colored  glass  transmits  only  k  times  as  much  light  per  unit  area  as 
the  clear  glass  (k  is  between  0  and  1 ).  If  the  distance  from  top  to  bottom  (across  both  the  rectangle  and 
the  semicircle)  is  a  fixed  distance  H,  find  (in  terms  ofk)  the  ratio  of  vertical  side  to  horizontal  side  of  the 
rectangle  for  which  the  window  lets  through  the  most  light. 

Exercise  5.7.21  You  are  designing  a  poster  to  contain  a  fixed  amount  A  of  printing  (measured  in  square 
centimeters)  and  have  margins  of  a  centimeters  at  the  top  and  bottom  and  b  centimeters  at  the  sides.  Find 
the  ratio  of  vertical  dimension  to  horizontal  dimension  of  the  printed  area  on  the  poster  if  you  want  to 
minimize  the  amount  of  posterboard  needed. 

Exercise  5.7.22  What  fraction  of  the  volume  of  a  sphere  is  taken  up  by  the  largest  cylinder  that  can  befit 
inside  the  sphere? 

Exercise  5.7.23  The  U.S.  post  office  will  accept  a  box  for  shipment  only  if  the  sum  of  the  length  and  girth 
(distance  around)  is  at  most  108  in.  Find  the  dimensions  of  the  largest  acceptable  box  with  square  front 
and  back. 

Exercise  5.7.24  Find  the  dimensions  of  the  lightest  cylindrical  can  containing  0.25  liter  ( =250  cm3)  if  the 
top  and  bottom  are  made  of  a  material  that  is  twice  as  heavy  (per  unit  area)  as  the  material  used  for  the 
side. 

Exercise  5.7.25  A  conical  paper  cup  is  to  hold  1/4  of  a  liter.  Find  the  height  and  radius  of  the  cone  which 
minimizes  the  amount  of  paper  needed  to  make  the  cup.  Use  the  formula  nr  \J  r1  +  h2  for  the  area  of  the 
side  of  a  cone. 
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Exercise  5.7.26  A  conical  paper  cup  is  to  hold  a  fixed  volume  of  water.  Find  the  ratio  of  height  to 
base  radius  of  the  cone  which  minimizes  the  amount  of  paper  needed  to  make  the  cup.  Use  the  formula 
1 Try/ r2  +h2  for  the  area  of  the  side  of  a  cone,  called  the  lateral  area  of  the  cone. 

Exercise  5.7.27  Find  the  fraction  of  the  area  of  a  triangle  that  is  occupied  by  the  largest  rectangle  that 
can  be  drawn  in  the  triangle  (with  one  of  its  sides  along  a  side  of  the  triangle).  Show  that  this  fraction 
does  not  depend  on  the  dimensions  of  the  given  triangle. 

Exercise  5.7.28  Flow  are  your  answers  to  Problem  5.7.8  affected  if  the  cost  per  item  for  the  x  items, 
instead  of  being  simply  $2,  decreases  below  $2  in  proportion  to  x  (because  of  economy  of  scale  and 
volume  discounts)  by  1  cent  for  each  25  items  produced? 


6.  Integration 


6.1  Displacement  and  Area 


Example  6.1:  Object  Moving  in  a  Straight  Line 


An  object  moves  in  a  straight  line  so  that  its  speed  at  time  t  is  given  by  v(t  )  =  3/  in,  say,  cm/sec.  If 
the  object  is  at  position  10  on  the  straight  line  when  t  —  0,  where  is  the  object  at  any  time  t  ? 


Solution.  There  are  two  reasonable  ways  to  approach  this  problem.  If  s(t)  is  the  position  of  the  object 
at  time  t,  we  know  that  s'(t )  =  v(t).  Based  on  our  knowledge  of  derivatives,  we  therefore  know  that 
s(t)  —  3t2 /2-\-k,  and  because  ^(O)  =  lOwe  easily  discover  that  k  =  10,  so  s(t)  =  3t2/2+10.  For  example, 
at  t  —  1  the  object  is  at  position  3/2  +  10  =  1 1.5.  This  is  certainly  the  easiest  way  to  deal  with  this  problem. 
Not  all  similar  problems  are  so  easy,  as  we  will  see;  the  second  approach  to  the  problem  is  more  difficult 
but  also  more  general. 

We  start  by  considering  how  we  might  approximate  a  solution.  We  know  that  at  t  =  0  the  object  is  at 
position  10.  How  might  we  approximate  its  position  at,  say,  1  =  1?  We  know  that  the  speed  of  the  object  at 
time  t  —  0  is  0;  if  its  speed  were  constant  then  in  the  first  second  the  object  would  not  move  and  its  position 
would  still  be  10  when  t  =  1.  In  fact,  the  object  will  not  be  too  far  from  10  at  t  —  1,  but  certainly  we  can 
do  better.  Let’s  look  at  the  times  0.1,  0.2,  0.3,  . . . ,  1.0,  and  try  approximating  the  location  of  the  object  at 
each,  by  supposing  that  during  each  tenth  of  a  second  the  object  is  going  at  a  constant  speed.  Since  the 
object  initially  has  speed  0,  we  again  suppose  it  maintains  this  speed,  but  only  for  a  tenth  of  second;  during 
that  time  the  object  would  not  move.  During  the  tenth  of  a  second  from  t  =  0. 1  to  t  —  0.2,  we  suppose  that 
the  object  is  traveling  at  0.3  cm/sec,  namely,  its  actual  speed  at  t  —  0.1.  In  this  case  the  object  would  travel 
(0.3) (0. 1)  =  0.03  centimeters:  0.3  cm/sec  times  0.1  seconds.  Similarly,  between  t  =  0.2  and  t  —  0.3  the 
object  would  travel  (0.6)  (0.1)  =  0.06  centimeters.  Continuing,  we  get  as  an  approximation  that  the  object 
travels 

(0.0)(0.1)  +  (0.3)  (0. 1)  +  (0.6)  (0. 1)  +  •  •  •+  (2.7)(0.1)  =  1.35 

centimeters,  ending  up  at  position  11.35.  This  is  a  better  approximation  than  10,  certainly,  but  is  still  just 
an  approximation.  (We  know  in  fact  that  the  object  ends  up  at  position  11.5,  because  we’ve  already  done 
the  problem  using  the  first  approach.)  Presumably,  we  will  get  a  better  approximation  if  we  divide  the 
time  into  one  hundred  intervals  of  a  hundredth  of  a  second  each,  and  repeat  the  process: 

(0.0)(0.01)  +  (0.03)  (0.01)  +  (0.06)(0.01)  +  •  •  •  +  (2.97)(0.01)  =  1.485. 

We  thus  approximate  the  position  as  1 1.485.  Since  we  know  the  exact  answer,  we  can  see  that  this  is  much 
closer,  but  if  we  did  not  already  know  the  answer,  we  wouldn’t  really  know  how  close. 
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We  can  keep  this  up,  but  we’ll  never  really  know  the  exact  answer  if  we  simply  compute  more  and 
more  examples.  Let’s  instead  look  at  a  “typical”  approximation.  Suppose  we  divide  the  time  into  n  equal 
intervals,  and  imagine  that  on  each  of  these  the  object  travels  at  a  constant  speed.  Over  the  first  time  in¬ 
terval  we  approximate  the  distance  traveled  as  (0.0)  (1/ft)  =  0,  as  before.  During  the  second  time  interval, 
from  t  —  1/nto  t  —  2/n,  the  object  travels  approximately  3(l/n)(l/n)  =  3/n2  centimeters.  During  time 
interval  number  i,  the  object  travels  approximately  (3(/  —  \  )fn){\fn)  —  3{i  —  \  )  /  n2  centimeters,  that  is, 
its  speed  at  time  (/  —  1 )/n,  3(i  —  l)/n,  times  the  length  of  time  interval  number  i,  l/n.  Adding  these  up  as 
before,  we  approximate  the  distance  traveled  as 

(0)  — b  3—  +  3(2)  —  +  3(3)  — ~  +  •  •  •  +  3{n  —  1)  — 
n  77“  nA  nA  nA 

centimeters.  What  can  we  say  about  this?  At  first  it  looks  rather  less  useful  than  the  concrete  calculations 
we’ve  already  done,  but  in  fact  a  bit  of  algebra  reveals  it  to  be  much  more  useful.  We  can  factor  out  a  3 
and  I/77 2  to  get 

— y  (0+1  +  2  +  3-I - \-(n  —  1)), 

nL 

that  is,  3/n2  times  the  sum  of  the  first  n  —  1  positive  integers.  Now  we  make  use  of  a  fact  you  may  have 
run  across  before,  Gauss’s  Equation: 


1  +2  +  3  -| - \-k 


k(k+l) 

2 


In  our  case  we’re  interested  in  k  =  n  —  1,  so 


1 +  2  +  3-1 - \-{n  —  1) 


(»-!)(») 

2 


—  77 


2 


This  simplifies  the  approximate  distance  traveled  to 


3  n2  —  n  3  772  —  77  3  f  n2  n\  3  /  1 

n2  2  In 2  2  \n2  n2  J  2  \  n 


Now  this  is  quite  easy  to  understand:  as  n  gets  larger  and  larger  this  approximation  gets  closer  and  closer 
to  (3/2)  (1  —  0)  —  3/2,  so  that  3/2  is  the  exact  distance  traveled  during  one  second,  and  the  final  position 
is  11.5. 

So  for  t  —  1,  at  least,  this  rather  cumbersome  approach  gives  the  same  answer  as  the  first  approach.  But 
really  there’s  nothing  special  about  t  —  1;  let’s  just  call  it  t  instead.  In  this  case  the  approximate  distance 
traveled  during  time  interval  number  i  is  3(i—  \)(t/n)(t/n)  —  3{i—  l)t2/n2,  that  is,  speed  3 (7  —  \){t/n) 
times  time  t/n,  and  the  total  distance  traveled  is  approximately 


(0) 77  +  3(1)— r  +  3(2)  — 2  +  3(3)  — y -I - h 3(?  1-  l)-^. 

77  n^  77  ^  77^  77  ^ 

As  before  we  can  simplify  this  to 


(0+  1  +2-| - b  (77  —  1)) 


In  the  limit,  as  n  gets  larger,  this  gets  closer  and  closer  to  (3/2)r  and  the  approximated  position  of  the 
object  gets  closer  and  closer  to  (3/2)t2+  10,  so  the  actual  position  is  (3/2)t2  +  10,  exactly  the  answer 
given  by  the  first  approach  to  the  problem.  X 
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Example  6.2:  Area  under  the  Line 


Find  the  area  under  the  curve  y  —  3x  between  x  —  0  and  any  positive  value  x. 


Solution.  There  is  here  no  obvious  analogue  to  the  first  approach  in  the  previous  example,  but  the  second 
approach  works  fine.  (Since  the  function  y  —  3x  is  so  simple,  there  is  another  approach  that  works  here,  but 
it  is  even  more  limited  in  potential  application  than  is  approach  number  one.)  How  might  we  approximate 
the  desired  area?  We  know  how  to  compute  areas  of  rectangles,  so  we  approximate  the  area  by  rectangles. 
Jumping  straight  to  the  general  case,  suppose  we  divide  the  interval  between  0  and  x  into  n  equal  subinter¬ 
vals,  and  use  a  rectangle  above  each  subinterval  to  approximate  the  area  under  the  curve.  There  are  many 
ways  we  might  do  this,  but  let’s  use  the  height  of  the  curve  at  the  left  endpoint  of  the  subinterval  as  the 
height  of  the  rectangle,  as  in  figure  6.1.  The  height  of  rectangle  number  i  is  then  3(i  —  1  )(x/n),  the  width 
is  x/n,  and  the  area  is  3  (i  —  1  )(x2/n2).  The  total  area  of  the  rectangles  is 


(°)^  +  3(1)^  +  3(2)^  +  3(3)^  +  --  +  3^-1)^- 

n  n  n-  n 


9/9 

By  factoring  out  3 x  /n  this  simplifies  to 


^(0+l+2  +  ---  +  (n— 1))  = 


3x2  n2  —  n  3 


=  —x  I  1  -  - 


n 


As  n  gets  larger  this  gets  closer  and  closer  to  3x2/2,  which  must  therefore  be  the  true  area  under  the  curve. 


Figure  6.1:  Approximating  the  area  under  y  =  3x  with  rectangles. 

What  you  will  have  noticed,  of  course,  is  that  while  the  problem  in  the  second  example  appears  to 
be  much  different  than  the  problem  in  the  first  example,  and  while  the  easy  approach  to  problem  one 
does  not  appear  to  apply  to  problem  two,  the  “approximation”  approach  works  in  both,  and  moreover  the 
calculations  are  identical.  As  we  will  see,  there  are  many,  many  problems  that  appear  much  different  on 
the  surface  but  turn  out  to  be  the  same  as  these  problems,  in  the  sense  that  when  we  try  to  approximate 
solutions  we  end  up  with  mathematics  that  looks  like  the  two  examples,  though  of  course  the  function 
involved  will  not  always  be  so  simple. 


222  Integration 


Even  better,  we  now  see  that  while  the  second  problem  did  not  appear  to  be  amenable  to  approach  one, 
it  can  in  fact  be  solved  in  the  same  way.  The  reasoning  is  this:  we  know  that  problem  one  can  be  solved 
easily  by  finding  a  function  whose  derivative  is  3 t.  We  also  know  that  mathematically  the  two  problems 
are  the  same,  because  both  can  be  solved  by  taking  a  limit  of  a  sum,  and  the  sums  are  identical.  Therefore, 
we  don’t  really  need  to  compute  the  limit  of  either  sum  because  we  know  that  we  will  get  the  same  answer 
by  computing  a  function  with  the  derivative  3t  or,  which  is  the  same  thing,  3x. 

It’s  true  that  the  first  problem  had  the  added  complication  of  the  “10”,  and  we  certainly  need  to  be  able 
to  deal  with  such  minor  variations,  but  that  turns  out  to  be  quite  simple.  The  lesson  then  is  this:  whenever 
we  can  solve  a  problem  by  taking  the  limit  of  a  sum  of  a  certain  form,  instead  of  computing  the  (often 
nasty)  limit  we  can  find  a  new  function  with  a  certain  derivative. 

6.1.1.  Riemann  Sums 
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A  fundamental  calculus  technique  is  to  first  answer  a  given  problem  with  an  approximation,  then  refine 
that  approximation  to  make  it  better,  then  use  limits  in  the  refining  process  to  find  the  exact  answer.  That 
is  exactly  what  we  will  do  here  to  develop  a  technique  to  find  the  area  of  more  complicated  regions. 

Consider  the  region  given  in  Figure  6.2,  which  is  the  area  under  y  =  4x  —  x2  on  [0,4].  What  is  the 
signed  area  of  this  region?  While  we  will  not  use  this  notation  in  this  section,  we  will  soon  see  that  this  is 
equivalent  to  finding  the  integral  given  by  Jq  (4x  —  x2)  dx, 


y 


Figure  6.2:  f(x)  —  4x  —  x2 


We  start  by  approximating.  We  can  surround  the  region  with  a  rectangle  with  height  and  width  of 
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4  and  find  the  area  is  approximately  16  square  units.  This  is  obviously  an  over-approximation',  we  are 
including  area  in  the  rectangle  that  is  not  under  the  parabola.  How  can  we  refine  our  approximation  to 
make  it  better?  The  key  to  this  section  is  this  answer:  use  more  rectangles. 

Let’s  use  four  rectangles  of  equal  width  of  1.  This  partitions  the  interval  [0,4]  into  four  subintervals, 
[0, 1],  [1,2],  [2,3]  and  [3,4].  On  each  subinterval  we  will  draw  a  rectangle. 

There  are  three  common  ways  to  determine  the  height  of  these  rectangles:  the  Left  Hand  Rule,  the 
Right  Hand  Rule,  and  the  Midpoint  Rule.  The  Left  Hand  Rule  says  to  evaluate  the  function  at  the  left- 
hand  endpoint  of  the  subinterval  and  make  the  rectangle  that  height.  In  Figure  6.3  below,  the  rectangle 
drawn  on  the  interval  [2, 3]  has  height  determined  by  the  Left  Hand  Rule;  it  has  a  height  of  /( 2)  =  4.  (The 
rectangle  is  labeled  “LHR.”) 


y 


Figure  6.3:  Approximating  area  using  rectangles 

The  Right  Hand  Rule  says  the  opposite:  on  each  subinterval,  evaluate  the  function  at  the  right  end¬ 
point  and  make  the  rectangle  that  height.  In  the  figure,  the  rectangle  drawn  on  [0, 1]  is  drawn  using  /( 1)  =  3 
as  its  height;  this  rectangle  is  labeled  “RHR.”. 

The  Midpoint  Rule  says  that  on  each  subinterval,  evaluate  the  function  at  the  midpoint  and  make  the 
rectangle  that  height.  The  rectangle  drawn  on  [1,2]  was  made  using  the  Midpoint  Rule,  with  a  height  of 
/( 1.5)  =  3.75.  That  rectangle  is  labeled  “MPR.” 

These  are  the  three  most  common  rules  for  determining  the  heights  of  approximating  rectangles,  but 
we  are  not  forced  to  use  one  of  these  three  methods.  The  rectangle  on  [3,4]  has  a  height  of  approximately 
/( 3.53),  very  close  to  the  Midpoint  Rule.  It  was  chosen  so  that  the  area  of  the  rectangle  is  exactly  the  area 
of  the  region  under  /  on  [3,4].  (Later  you’ll  be  able  to  figure  how  to  do  this,  too.) 

The  following  example  will  approximate  the  area  under  f(x)  —  4x  —  x2  using  these  rules. 


Example  6.3:  Using  the  Left  Hand,  Right  Hand  and  Midpoint  Rules 


Approximate  the  area  under  f(x)  =  Ax  —  x2  on  the  interval  [0,4]  using  the  Left  Hand  Rule,  the  Right 
Hand  Rule,  and  the  Midpoint  Rule,  using  four  equally  spaced  subintervals. 


Solution.  We  break  the  interval  [0,4]  into  four  subintervals  as  before.  In  Figure  6.4  we  see  four  rectangles 
drawn  on  f(x)  —  4x  —  x2  using  the  Left  Hand  Rule.  (The  areas  of  the  rectangles  are  given  in  each  figure.) 
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Figure  6.4:  Approximating  area  using  the  Left  Hand  Rule 

Note  how  in  the  first  subinterval,  [0, 1],  the  rectangle  has  height  /( 0)  =  0.  We  add  up  the  areas  of  each 
rectangle  (height x  width)  for  our  Left  Hand  Rule  approximation: 

/(0)  •  1  +/(1)  •  1  +/(2)  •  1  +/(3)  •  1  = 

0  +  3  +  4  +  3=  10. 

Figure  6.5  shows  four  rectangles  drawn  under  f(x)  using  the  Right  Hand  Rule;  note  how  the  [3,4]  subin¬ 
terval  has  a  rectangle  of  height  0. 


Y 


Figure  6.5:  Approximating  area  using  the  Right  Hand  Rule, 


In  this  figure,  these  rectangles  seem  to  be  the  mirror  image  of  those  found  in  Figure  6.4.  (This  is  because 
of  the  symmetry  of  our  shaded  region.)  Our  approximation  gives  the  same  answer  as  before,  though 
calculated  a  different  way: 


/(l) '  1  +/(2)  •  1  +/(3)  •  1  +/(4)  •  1  = 

3  +  4  +  3  +  0=  10. 


Figure  6.6  shows  four  rectangles  drawn  under  f(x)  using  the  Midpoint  Rule. 


6. 1 .  Displacement  and  Area  225 


Y 


Figure  6.6:  Approximating  area  using  the  Midpoint  Rule 

This  gives  an  approximation  of  the  area  as: 

/( 0.5)  •  1  +/(1.5)  •  1  +/(2.5)  •  1  +/(3.5)  •  1  = 

1.75  +  3.75  +  3.75  +  1.75  =  11. 

Our  three  methods  provide  two  approximations  of  the  area  under  f(x)  —  4x  —  x2:  10  and  11.  & 

It  is  hard  to  tell  at  this  moment  which  is  a  better  approximation:  10  or  11?  We  can  continue  to  refine 
our  approximation  by  using  more  rectangles.  The  notation  can  become  unwieldy,  though,  as  we  add  up 
longer  and  longer  lists  of  numbers.  We  introduce  summation  notation  (also  called  sigma  notation)  to 
solve  this  problem. 

Suppose  we  wish  to  add  up  a  list  of  numbers  a\,  aj,  <33,  . . . ,  <29.  Instead  of  writing 

Cl\  +  CI2  +  @3  +  a4  T  ^5  T  a6  T  Cl~]  +  $8  +  £?9, 
we  use  summation  notation  and  write 

9 

i=l 

The  upper  case  sigma  (£)  represents  the  term  “sum”  and  at  is  referred  to  as  the  summand.  The  index 
of  summation  in  this  example  is  i;  any  symbol  can  be  used.  By  convention,  the  index  takes  on  only  the 
integer  values  between  (and  including)  the  lower  and  upper  bounds,  here  equal  to  1  and  9  respectively. 
Let’s  practice  using  this  notation. 


Solution. 
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1. 

6 

^  Cli  —  Cl\  +  Cl2  +  +  £4  +  6(5  +  <26 

Z=1 

=  1  +  3  +  5  +  7  +  9+11 
=  36. 

2.  Note  the  starting  value  is  different  than  1: 

7 

^  (3  cii  —  4)  =  (3tf3  —  4)  +  (3«4  —  4)  +  (3«5  —  4)  +  (3«6  —  4)  +  (3aj  —  4) 

i= 3 

=  11  +  17  +  23  +  29  +  35 
-  115. 


^(a,-)2  —  (at)2  +  (02)  +  (+3)2  +  (+4) 
j=t 

=  l2  +  32  +  52  +  72 
-84 


X 

The  following  theorem  gives  some  properties  of  summations  that  allow  us  to  work  with  them  without 
writing  individual  terms.  Examples  will  follow. 
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Solution. 


£(2i— l)  =  £2i— £(l) 
1=1  i—  1  i=  1 


6(6+1) 

2 


=  42-6  =  36 


We  obtained  the  same  answer  without  writing  out  all  six  terms.  When  dealing  with  small  values  of 
n,  it  may  be  faster  to  write  the  terms  out  by  hand.  However,  Theorem  6.5  is  incredibly  important  when 
dealing  with  large  sums  as  we’ll  soon  see.  X 

Consider  again  /(x )  =  4x  — x2.  We  will  approximate  the  area  under  this  curve  (again  for  [0,4]  using 
sixteen  equally  spaced  subintervals  and  the  Right  Hand  Rule.  Before  doing  so,  we  will  do  some  careful 
preparation. 


0  12  3  4 

Xi  X5  x9  Xi3  xi7 


Figure  6.7:  Dividing  [0,4]  into  16  equally  spaced  subintervals 

Figure  6.7  shows  a  number  line  of  [0,4]  divided  into  sixteen  equally  spaced  subintervals.  We  denote  0 
as  x\ ;  we  have  marked  the  values  of  X5,  X9,  X13  and  X17.  We  could  mark  them  all,  but  the  figure  would  get 
crowded.  While  it  is  easy  to  figure  that  x  1 0  =  2.25,  in  general,  we  want  a  method  of  determining  the  value 
of  Xj  without  consulting  the  figure.  Consider: 


Xi 

=  x  1  +  ( /  —  1 )  At 

where 

xi 

:  starting  value 

(i-1) 

:  number  of  subintervals  between  x\  and  x, 

At 

:  subinterval  width 

So  xio  =  xi  +9(4/16)  =  2.25. 

If  we  had  partitioned  [0,4]  into  100  equally  spaced  subintervals,  each  subinterval  would  have  length 
At  =  4/100  =  0.04.  We  could  compute  X32  as 

x32  =  0  +  31(4/100)  =  1.24. 

(That  was  far  faster  than  creating  a  sketch  first.) 

Given  any  subdivision  of  [0,4],  the  first  subinterval  is  [xi  ,x2];  the  second  is  [x2,X3];  the  i th  subinterval 
is  [x/,Xj+i], 
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When  using  the  Left  Hand  Rule,  the  height  of  the  i th  rectangle  will  be  /(x,-). 

When  using  the  Right  Hand  Rule,  the  height  of  the  1th  rectangle  will  be  /'(x/+i ). 

When  using  the  Midpoint  Rule,  the  height  of  the  i th  rectangle  will  be  / 

Thus  approximating  the  area  under  f(x)  =  4x  —  x2  on  [0,4]  with  sixteen  equally  spaced  subintervals 
can  be  expressed  as  follows,  where  Ax  =  4/16  =  1/4: 

16 

Left  Hand  Rule:  £  /(x,)  Ax 
1=1 
16 

Right  Hand  Rule:  £/(x,-+i)Av 

i=  1 

Midpoint  Rule:  £/  1  1  j  Ax 

We  use  these  formulas  in  the  following  example. 


^X/+X/+l^ 


Example  6.7:  Approximating  Area  Using  Sums 


Approximate  the  area  under  f(x)  —  4x  — x2  on  [0,4]  using  the  Right  Hand  Rule  and  summation 
formulas  with  sixteen  and  1000  equally  spaced  intervals. 


Solution.  Using  sixteen  equally  spaced  intervals  and  the  Right  Hand  Rule,  we  can  approximate  the  area 
as 

16 

£/(*/+  i)Ax. 

/=i 

We  have  Ax  =  4/16  =  0.25.  Since  x,-  =  0  +  (z  —  l)Ax,  we  have 

x,-+i  =  0  +  ((z  +  1)  —  l)  Ax 
=  zAx 


Using  the  summation  formulas,  consider: 


16  16 

Y  /(*«+  i  =  Y 

i=  1  (=  1 

16 

=  £  (4z‘Ax—  (zAx)2)Ar 

i=  1 
16 

=  Y  (4zAx2  —  r  Ac1  ) 
i=  l 

16  16 

=  (4Ax2)£z-Ax3£z2 
=  (4Av2)^-Ar3Wz)M 


(6.1) 


2 


6 
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=  4-0.252- 136  -0.253- 1496 
=  10.625 

We  were  able  to  sum  up  the  areas  of  sixteen  rectangles  with  very  little  computation.  In  Figure  6.8  the 
function  and  the  sixteen  rectangles  are  graphed.  While  some  rectangles  over-approximate  the  area,  other 
under-approximate  the  area  (by  about  the  same  amount).  Thus  our  approximate  area  of  10.625  is  likely  a 
fairly  good  approximation. 

y 


Figure  6.8:  Approximating  area  with  the  Right  Hand  Rule  and  16  evenly  spaced  subintervals 

Notice  Equation  (6.1);  by  changing  the  16’s  to  1,000’s  (and  appropriately  changing  the  value  of  Ax), 
we  can  use  that  equation  to  sum  up  1000  rectangles! 

We  do  so  here,  skipping  from  the  original  summand  to  the  equivalent  of  Equation  (6.1)  to  save  space. 
Note  that  Av  =  4/ 1000  =  0.004. 

1000  1000  1000 

£  /(-*/+i)Ax  =  (4Ax2)  £  /  -Ax3  £  i2 

1=1  1=1  1=1 

_  (AkJz^  1000-1001  A  3  1000(1001) (2001) 

—  (4AxJ  -  Ax  “ 

=  4  •  0.0042  •  500500  -  0.0043  •  333, 833, 500 
=  10.666656 

Using  many,  many  rectangles,  we  have  a  likely  good  approximation  of  the  area  under  f(x)  —  4x  — x2 
of  «  10.666656.  4 

Before  the  above  example,  we  stated  the  summations  for  the  Left  Hand,  Right  Hand  and  Midpoint 
Rules.  Each  had  the  same  basic  structure,  which  was: 

1 .  each  rectangle  has  the  same  width,  which  we  referred  to  as  Av,  and 

2.  each  rectangle’s  height  is  determined  by  evaluating  /(x)  at  a  particular  point  in  each  subinterval. 
For  instance,  the  Left  Hand  Rule  states  that  each  rectangle’s  height  is  determined  by  evaluating  /(x) 
at  the  left  hand  endpoint  of  the  subinterval  the  rectangle  lives  on. 

One  could  partition  an  interval  [a,b\  with  subintervals  that  did  not  have  the  same  width.  We  refer  to 
the  length  of  the  first  subinterval  as  Avj,  the  length  of  the  second  subinterval  as  A.vy ,  and  so  on,  giving  the 
length  of  the  / th  subinterval  as  Ax,-.  Also,  one  could  determine  each  rectangle’s  height  by  evaluating  /(x)  at 
any  point  in  the  / th  subinterval.  We  refer  to  the  point  picked  in  the  first  subinterval  as  ci,  the  point  picked 
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in  the  second  subinterval  as  C2,  and  so  on,  with  c,  representing  the  point  picked  in  the  i th  subinterval.  Thus 
the  height  of  the  i th  subinterval  would  be  /(c,-),  and  the  area  of  the  i th  rectangle  would  be  /(c,-)  Ax,-. 

Summations  of  rectangles  with  area  /(c,  )  Ax,  are  named  after  mathematician  Georg  Friedrich  Bernhard 
Riemann,  as  given  in  the  following  definition. 


Y 


Figure  6.9:  General  Riemann  sum  to  approximate  the  area  under  f(x)  —4x  —  x2 


Figure  6.9  shows  the  approximating  rectangles  of  a  Riemann  sum.  While  the  rectangles  in  this  example 
do  not  approximate  well  the  shaded  area,  they  demonstrate  that  the  subinterval  widths  may  vary  and  the 
heights  of  the  rectangles  can  be  determined  without  following  a  particular  rule. 

Riemann  sums  are  typically  calculated  using  one  of  the  three  rules  we  have  introduced.  The  uniformity 
of  construction  makes  computations  easier.  Before  working  another  example,  let’s  summarize  some  of 
what  we  have  learned  in  a  convenient  way. 
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Riemann  Sums 


Consider  a  function  f(x)  defined  on  an  interval  [a,  b] .  The  area  under  this  curve  is  approximated  by 

Lun^Axi. 


1 .  When  the  n  subintervals  have  equal  length,  A jq  =  Ar  = - . 

n 

2.  The  i th  term  of  the  partition  is  xt  —  a  +  (i  —  l)Ar.  (This  makes  xn+\  —  b.) 

n 

3.  The  Left  Hand  Rule  summation  is:  ^  f(x;)Ax. 

i=  1 
n 

4.  The  Right  Hand  Rule  summation  is:  ^  fUi\  i )  A_v . 

i=  1 
n 

5.  The  Midpoint  Rule  summation  is:  ^  / 

1=1 


Xi+Xx+ 1 


Ar. 


Let’s  do  another  example. 


Example  6.9:  Approximating  Area  Using  Sums 


Approximate  the  area  under  fix)  —  (5v  +  2)  on  the  interval  [—2, 3]  using  the  Midpoint  Rule  and  ten 
equally  spaced  intervals. 


Solution  Following  the  above  discussion,  we  have 

Xi  —  (—2)  +  (l/2)0‘—  1)  =  i/2  —  5/2. 

As  we  are  using  the  Midpoint  Rule,  we  will  also  need  jc,-+ 1  and  —  ^A,+1 


.  Since  Xj  —  i/2  —  5/2,  Xj+\  = 


(/+  l)/2  —  5 /2  =  if  2  —  2.  This  gives 


*«■  +  *»■+!  =  (f/2- 5/2)  + (i/2-2)  =  i  —  9/2  =  j/2_g/^ 

We  now  construct  the  Riemann  sum  and  compute  its  value  using  summation  formulas. 


to 


Lf 


Xi+xi+ 1 


Ar 


10 

£/(«'/ 2  — 9/4)  Ax 


i=  1 

10 


£(5(i/2-9/4)+2)Av 

(=1 


10 


i=  1 


2 
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Note  the  graph  of  f(x)  —  5x  +  2  in  Figure  6.10.  The  regions  whose  areas  are  computed  are  triangles, 
meaning  we  can  find  the  exact  answer  without  summation  techniques.  We  find  that  the  exact  answer  is 
indeed  22.5.  One  of  the  strengths  of  the  Midpoint  Rule  is  that  often  each  rectangle  includes  area  that  should 
not  be  counted,  but  misses  other  area  that  should.  When  the  partition  width  is  small,  these  two  amounts 
are  about  equal  and  these  errors  almost  “cancel  each  other  out.”  In  this  example,  since  our  function  is  a 
line,  these  errors  are  exactly  equal  and  they  do  cancel  each  other  out,  giving  us  the  exact  answer. 

y 


Figure  6.10:  Approximating  area  using  the  Midpoint  Rule  and  10  evenly  spaced  subintervals 

Note  too  that  when  the  function  is  negative,  the  rectangles  have  a  “negative”  height.  When  we  compute 
the  area  of  the  rectangle,  we  use  /(c,-)Ar;  when  /  is  negative,  the  area  is  counted  as  negative.  & 

Notice  in  the  previous  example  that  while  we  used  ten  equally  spaced  intervals,  the  number  “10”  didn’t 
play  a  big  role  in  the  calculations  until  the  very  end.  Mathematicians  love  abstract  ideas;  let’s  approximate 
the  area  of  another  region  using  n  subintervals,  where  we  do  not  specify  a  value  of  n  until  the  very  end. 


Example  6.10:  Approximating  Area  Using  Sums 


Revisit  f(x)  —  4x  —  x2  on  the  interval  [0,4,]  yet  again.  Approximate  the  area  under  this  curve  using 
the  Right  Hand  Rule  with  n  equally  spaced  subintervals. 


Solution.  We  know  Ax  —  =  4/n.  We  also  find  jt;  =  0  +  At (i  —  1)  =  4(i  —  1  )/n.  The  Right  Hand  Rule 

uses  jCf+i,  which  is  jc,-+  i  =  4 i/n.  We  construct  the  Right  Hand  Rule  Riemann  sum  as  follows. 


i= l 


£  /(*/+  i)Ax  =£/(  —  )  At 

n 

=  E 


i=  1 


i=  1 


4—  —  (  — 
n 


At 


16  At 


1=1 


16  At 
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16  Ax 


n 


I-'- 

i=l 


16  Ax 


I- 

i=l 


•2 


16Ax\  n(n  +  1)  /  16Ax\  n(n+  l)(2n+  1) 


/i 


32(n+ 1)  32(n+l)(2n  +  l) 


3/r2 


(now  simplify) 


> 


/  recall 
l  Aa  =  4/w 


The  result  is  an  amazing,  easy  to  use  formula.  To  approximate  the  area  with  ten  equally  spaced 
subintervals  and  the  Right  Hand  Rule,  set  n  =  10  and  compute 


32 

y 


10.56. 


Recall  how  earlier  we  approximated  the  area  with  4  subintervals;  with  n  =  4,  the  formula  gives  10,  our 
answer  as  before. 

It  is  now  easy  to  approximate  the  area  with  1,000,000  subintervals!  Hand-held  calculators  will  round 
off  the  answer  a  bit  prematurely  giving  an  answer  of  10.66666667.  (The  actual  answer  is  10.666666666656.) 

We  now  take  an  important  leap.  Up  to  this  point,  our  mathematics  has  been  limited  to  geometry  and 
algebra  (finding  areas  and  manipulating  expressions).  Now  we  apply  calculus.  For  any  finite  n,  we  know 
that  the  corresponding  Right  Hand  Rule  Riemann  sum  is: 


32 

y 


Both  common  sense  and  high-level  mathematics  tell  us  that  as  n  gets  large,  the  approximation  gets  better. 
In  fact,  if  we  take  the  limit  as  n  — »  oo,  we  get  the  exact  area.  That  is, 


lim 

n—>  oo 


32 

y 


32 

y 

32 

y 


(1-0) 
=  10.6 


This  is  a  fantastic  result.  By  considering  n  equally-spaced  subintervals,  we  obtained  a  formula  for  an 
approximation  of  the  area  that  involved  our  variable  n.  As  n  grows  large  -  without  bound  -  the  error 
shrinks  to  zero  and  we  obtain  the  exact  area.  4k 


This  section  started  with  a  fundamental  calculus  technique:  make  an  approximation,  refine  the  approx¬ 
imation  to  make  it  better,  then  use  limits  in  the  refining  process  to  get  an  exact  answer.  That  is  precisely 
what  we  just  did. 

Let’s  practice  this  again. 


Example  6.11:  Approximating  Area  With  a  Formula,  Using  Sums 


Find  a  formula  that  approximates  the  area  under  f(x)  =  x3  on  the  interval  [—1,5]  using  the  Right 
Hand  Rule  and  n  equally  spaced  subintervals,  then  take  the  limit  as  n  — >■  °o  to  find  the  exact  area. 
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Solution.  We  have  Ax  =  5  ^  ^  =  6/n.  We  have  x,-  =  (— 1)  +  (i  —  l)Ax;  as  the  Right  Hand  Rule  uses  x;+i, 
we  have  x,+i  =  (  — 1)  +  /Ax. 

The  Riemann  sum  corresponding  to  the  Right  Hand  Rule  is  (followed  by  simplifications): 


i= l 


52  /(•*«+ 1  )Ax = 52  /(- 1 + i'Ax)Ax 

i=  1 
n 

=  52(-l  +  iAx)3  Ax 
i=  1 
n 

—  J2  ((/Ar)3  —  3(/Av)2  +  3/Ax  —  l)  Ax 

/—  l 
n 

=  £  (*3Ax4  —  3r  Ax3  +  3 /Ax2  —  Ax) 

i=  1 

=  Ax4  £ ,3  —  3  Ar3  £  ‘2  +  3  Ax2  52 z  —  52  ^ 


(=i 


i=l 


i=1  i=l 


=  Ax4 


4  / n(n  +  1) 


3Ai3”("+1)(2”+1)+3Ax2”("+1) -nAx 


1296  ?72(n+l)2  216  n(n  + l)(2n+ 1)  36/i(«H- 1) 


-3 


+  3- 


-6 


n 


378  216 

—  156  H - 1 - =- 


n  n- 


Once  again,  we  have  found  a  compact  formula  for  approximating  the  area  with  n  equally  spaced  subin¬ 
tervals  and  the  Right  Hand  Rule.  Using  ten  subintervals,  we  have  an  approximation  of  195.96  (these 
rectangles  are  shown  in  Figure  6.11).  Using  n  —  100  gives  an  approximation  of  159.802. 

Now  find  the  exact  answer  using  a  limit: 


lim 

n— »°o 


(l56  + 


378 
- + 


n 


156. 


X 


Y 


Figure  6.11:  Approximating  area  using  the  Right  Hand  Rule  and  10  evenly  spaced  subintervals. 
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We  have  used  limits  to  evaluate  exactly  given  definite  limits.  Will  this  always  work?  We  will  show, 
given  not-very-restrictive  conditions,  that  yes,  it  will  always  work. 

The  previous  two  examples  demonstrated  how  an  expression  such  as 

n 

£/(*i+l)  A* 

i=  1 

can  be  rewritten  as  an  expression  explicitly  involving  n,  such  as  32/3(1  —  1  /n2). 

Viewed  in  this  manner,  we  can  think  of  the  summation  as  a  function  of  n.  An  n  value  is  given  (where  n 
is  a  positive  integer),  and  the  sum  of  areas  of  n  equally  spaced  rectangles  is  returned,  using  the  Left  Hand, 
Right  Hand,  or  Midpoint  Rules. 

Given  a  function  /(x)  defined  on  the  interval  [a,  b\  let: 


n 

•  S/(n)  =  52  f(xj)  Ax,  the  sum  of  equally  spaced  rectangles  formed  using  the  Left  Hand  Rule, 

i=  1 


*  SR(n) 

•  SM(n ) 

Rule. 


=  52 )  Ax,  the  sum  of  equally  spaced  rectangles  formed  using  the  Right  Hand  Rule,  and 

i=  1 

n 

=  E  f 

i=  1 


X^+Xi+A 

2 


Ax,  the  sum  of  equally  spaced  rectangles  formed  using  the  Midpoint 


Recall  the  definition  of  a  limit  as  n  — >  lim  SR(n)  —  K  if,  given  any  e  >  0,  there  exists  8  >  0  such 

n— >oo 

that  | Si{n)  —K\  <  £  whenever  n  >  8. 

The  following  theorem  states  that  we  can  use  any  of  our  three  rules  to  find  the  exact  value  of  the  area 
under  /(x)  on  [a,  b\ .  It  also  goes  two  steps  further.  The  theorem  states  that  the  height  of  each  rectangle 
doesn’t  have  to  be  determined  following  a  specific  rule,  but  could  be  /(c,-),  where  c,  is  any  point  in  the  i th 
subinterval,  as  discussed  earlier. 


The  theorem  goes  on  to  state  that  the  rectangles  do  not  need  to  be  of  the  same  width.  Using  the 
notation  of  Definition  6.8,  let  Axj  denote  the  length  of  the  /th  subinterval  in  a  partition  of  [a,b\.  Now  let 
|  |Ax|  |  represent  the  length  of  the  largest  subinterval  in  the  partition:  that  is,  1 1  Ax|  |  is  the  largest  of  all  the 
Ax/s.  If  ||Ax||  is  small,  then  [a,b\  must  be  partitioned  into  many  subintervals,  since  all  subintervals  must 
have  small  lengths.  “Taking  the  limit  as  |  |Ax|  |  goes  to  zero”  implies  that  the  number  n  of  subintervals  in 
the  partition  is  growing  to  infinity,  as  the  largest  subinterval  length  is  becoming  arbitrarily  small.  We  then 
interpret  the  expression 

n 


lim  Y 


f(ci)Axi 


as  “the  limit  of  the  sum  of  rectangles,  where  the  width  of  each  rectangle  can  be  different  but  getting  small, 
and  the  height  of  each  rectangle  is  not  necessarily  determined  by  a  particular  rule.”  The  following  theorem 
states  that,  for  a  sufficiently  nice  function,  we  can  use  any  of  our  three  rules  to  find  the  area  under  /(x) 
over  [a,  b] . 
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We  summarize  what  we  have  learned  over  the  past  few  sections  here. 

•  Knowing  the  “area  under  the  curve”  can  be  useful.  One  common  example  is:  the  area  under  a 
velocity  curve  is  displacement. 

•  While  we  can  approximate  the  area  under  a  curve  in  many  ways,  we  have  focused  on  using  rectangles 
whose  heights  can  be  determined  using:  the  Left  Hand  Rule,  the  Right  Hand  Rule  and  the  Midpoint 
Rule. 

•  Sums  of  rectangles  of  this  type  are  called  Riemann  sums. 

•  The  exact  value  of  the  area  can  be  computed  using  the  limit  of  a  Riemann  sum.  We  generally  use 
one  of  the  above  methods  as  it  makes  the  algebra  simpler. 


Exercises  for  Section  6.1 


Exercise  6.1.1  Suppose  an  object  moves  in  a  straight  line  so  that  its  speed  at  time  t  is  given  by  v(t)  — 
2 1  +  2,  and  that  at  t  —  1  the  object  is  at  position  5.  Find  the  position  of  the  object  at  t  —  2. 

Exercise  6.1.2  Suppose  an  object  moves  in  a  straight  line  so  that  its  speed  at  time  t  is  given  by  v(t)  —  t2  +  2, 
and  that  at  t  —  0  the  object  is  at  position  5.  Find  the  position  of  the  object  all  —  2. 

Exercise  6.1.3  Find  the  area  under  y  —  2x  between  x  —  0  and  any  positive  value  for  x. 

Exercise  6.1.4  Find  the  area  under  y  —  4x  between  x  =  0  and  any  positive  value  for  x. 

Exercise  6.1.5  Find  the  area  under  y  —  4x  between  x  —  2  and  any  positive  value  for  x  bigger  than  2. 

Exercise  6.1.6  Find  the  area  under  y  —  Ax  between  any  two  positive  values  for  x,  say  a  <  b. 
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Exercise  6.1.7  Let  f(x)  —  x2  +  3x  +  2.  Approximate  the  area  under  the  cur\’e  between  x  —  0  and  x  —  2 
using  4  rectangles  and  also  using  8  rectangles. 

Exercise  6.1.8  Let  f(x)  —  x2  —  2x  +  3.  Approximate  the  area  under  the  cur\>e  between  x  =  1  and  x  —  3 
using  4  rectangles. 
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We  begin  by  exploring  an  example.  Suppose  that  an  object  moves  in  a  straight  line  so  that  its  speed  is  3 1 
at  time  t.  How  far  does  the  object  travel  between  time  t  —  a  and  time  t  —  bl  We  don’t  assume  that  we 
know  where  the  object  is  at  time  t  —  0  or  at  any  other  time.  It  is  certainly  true  that  it  is  somewhere,  so 
let’s  suppose  that  at  t  —  0  the  position  is  k.  Then  we  know  that  the  position  of  the  object  at  any  time  is 
3t2/2  +  k.  This  means  that  at  time  t  —  a  the  position  is  3a2 /2  +  k  and  at  time  t  —  b  the  position  is  3b2 /2  +  k. 
Therefore  the  change  in  position  is  3b2 /2  +  k—  (3a2 /2  +  k)  —  3b2 / 2  — 3a2 / 2.  Notice  that  the  k  drops  out; 
this  means  that  it  doesn’t  matter  that  we  don’t  know  k,  it  doesn’t  even  matter  if  we  use  the  wrong  k,  we  get 
the  correct  answer. 

What  about  a  second  approach  to  this  problem?  We  now  want  to  approximate  the  change  in  position 
between  time  a  and  time  b.  We  take  the  interval  of  time  between  a  and  b,  divide  it  into  n  subintervals, 
and  approximate  the  distance  traveled  during  each.  The  starting  time  of  subinterval  number  i  is  now 
a  +  (i  —  1 )  (b  —  a)/n,  which  we  abbreviate  as  i ,  so  that  to  —  a,t\  =  a  +  (b  —  a)/n,  and  so  on.  The  speed 
of  the  object  is  f(t)  —  3 1,  and  each  subinterval  is  (b  —  a)/n  —  At  seconds  long.  The  distance  traveled 
during  subinterval  number  i  is  approximately  and  the  total  change  in  distance  is  approximately 

f(to)At  +  f(ti)At  H - \-  f(tn-i)At . 


The  exact  change  in  position  is  the  limit  of  this  sum  as  n  goes  to  infinity.  We  abbreviate  this  sum  using 

sigma  notation: 

n— 1 

Y  fib)  &  =  f(to)At  +  f(t\)At  H - \-f(tn-i)At. 

i= 0 

The  notation  on  the  left  side  of  the  equal  sign  uses  a  large  capital  sigma,  a  Greek  letter,  and  the  left  side  is 
an  abbreviation  for  the  right  side.  The  answer  we  seek  is 


LI —  1 


lim  Y  fib) At. 

n — Voo 


i=0 


Since  this  must  be  the  same  as  the  answer  we  have  already  obtained,  we  know  that 


n—  1 

lim  Y  fib)  At 

i=() 


3b2 

~Y 


3a2 

~2' 


The  significance  of  3t2 / 2,  into  which  we  substitute  t  —  b  and  t  —  a,  is  of  course  that  it  is  a  function  whose 
derivative  is  f(t).  As  we  have  discussed,  by  the  time  we  know  that  we  want  to  compute 
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it  no  longer  matters  what  f(t)  stands  for — it  could  be  a  speed,  or  the  height  of  a  curve,  or  something  else 
entirely.  We  know  that  the  limit  can  be  computed  by  finding  any  function  with  derivative  f(t),  substituting 
a  and  b,  and  subtracting.  We  summarize  this  in  a  theorem.  First,  we  introduce  some  new  notation  and 
terms. 

We  write 

[  fit)  dt  =  lim  £ 

if  the  limit  exists.  That  is,  the  left  hand  side  means,  or  is  an  abbreviation  for,  the  right  hand  side.  The 
symbol  /  is  called  an  integral  sign,  and  the  whole  expression  is  read  as  “the  integral  of  f(t)  from  a  to 
b'd  What  we  have  learned  is  that  this  integral  can  be  computed  by  finding  a  function,  say  F(t),  with  the 
property  that  F\t)  —  f(t),  and  then  computing  F(b)  —  F(a).  The  function  F(t)  is  called  an  antiderivative 
of  f(t).  Now  the  theorem: 


Let’s  rewrite  this  slightly: 

f  f(t)dt  —  F{x)  —  F(a). 

Ja 

We’ve  replaced  the  variable  x  by  t  and  b  by  x.  These  are  just  different  names  for  quantities,  so  the 
substitution  doesn’t  change  the  meaning.  It  does  make  it  easier  to  think  of  the  two  sides  of  the  equation  as 
functions.  The  expression 

[  fit)dt 

Ja 

is  a  function:  plug  in  a  value  for  x,  get  out  some  other  value.  The  expression  F(x)  — F{a )  is  of  course  also 
a  function,  and  it  has  a  nice  property: 

^(F(x)-F(a))  =  F'{x)  =  f{x), 

since  F(a)  is  a  constant  and  has  derivative  zero.  In  other  words,  by  shifting  our  point  of  view  slightly,  we 
see  that  the  odd  looking  function 

G(x)=  f  f{t)dt 

Ja 

has  a  derivative,  and  that  in  fact  G'{x)  =  f(x) .  This  is  really  just  a  restatement  of  the  Fundamental  Theorem 
of  Calculus,  and  indeed  is  often  called  the  Fundamental  Theorem  of  Calculus.  To  avoid  confusion,  some 
people  call  the  two  versions  of  the  theorem  “The  Fundamental  Theorem  of  Calculus,  part  I”  and  “The 
Fundamental  Theorem  of  Calculus,  part  II”,  although  unfortunately  there  is  no  universal  agreement  as  to 
which  is  part  I  and  which  part  II.  Since  it  really  is  the  same  theorem,  differently  stated,  some  people  simply 
call  them  both  “The  Fundamental  Theorem  of  Calculus.” 
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We  have  not  really  proved  the  Fundamental  Theorem.  In  a  nutshell,  we  gave  the  following  argument 
to  justify  it:  Suppose  we  want  to  know  the  value  of 

[  f(t)dt=  lim  £/(f;)Af. 

We  can  interpret  the  right  hand  side  as  the  distance  traveled  by  an  object  whose  speed  is  given  by  /(f).  We 
know  another  way  to  compute  the  answer  to  such  a  problem:  find  the  position  of  the  object  by  finding  an 
antiderivative  of  f(t),  then  substitute  t  —  a  and  t  =  b  and  subtract  to  find  the  distance  traveled.  This  must 
be  the  answer  to  the  original  problem  as  well,  even  if  f(t)  does  not  represent  a  speed. 

What’s  wrong  with  this?  In  some  sense,  nothing.  As  a  practical  matter  it  is  a  very  convincing  argument, 
because  our  understanding  of  the  relationship  between  speed  and  distance  seems  to  be  quite  solid.  From 
the  point  of  view  of  mathematics,  however,  it  is  unsatisfactory  to  justify  a  purely  mathematical  relationship 
by  appealing  to  our  understanding  of  the  physical  universe,  which  could,  however  unlikely  it  is  in  this  case, 
be  wrong. 

A  complete  proof  is  a  bit  too  involved  to  include  here,  but  we  will  indicate  how  it  goes.  First,  if  we  can 
prove  the  second  version  of  the  Fundamental  Theorem,  Theorem  6.14,  then  we  can  prove  the  first  version 
from  that: 

Proof.  Proof  of  Theorem  6.13. 

We  know  from  Theorem  6.14  that 

G{x)  =  [  f(t)dt 
Ja 

is  an  antiderivative  of  f(x),  and  therefore  any  antiderivative  F(x)  of  fix)  is  of  the  form  F(x)  —  G(x)  +  k. 
Then 


F(b)  —F(a)  =  G[b)  +  k—  (G(a)  +k) 


G(b)  -  G(a) 

rb 

/  f{t)dt- 

Ja 


It  is  not  hard  to  see  that  /  f(t)  dt  —  0,  so  this  means  that 

Ja 


F(b)  —F(a)  =  [  f{t)dt, 
Ja 


which  is  exactly  what  Theorem  6.13  says. 


* 


So  the  real  job  is  to  prove  Theorem  6.14.  We  will  sketch  the  proof,  using  some  facts  that  we  do  not 
prove.  First,  the  following  identity  is  true  of  integrals: 


/ c  m  dt+  fm 

Ja  J  c 


dt. 
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This  can  be  proved  directly  from  the  definition  of  the  integral,  that  is,  using  the  limits  of  sums.  It  is  quite 
easy  to  see  that  it  must  be  true  by  thinking  of  either  of  the  two  applications  of  integrals  that  we  have  seen. 
It  turns  out  that  the  identity  is  true  no  matter  what  c  is,  but  it  is  easiest  to  think  about  the  meaning  when 
a  <c  <b. 

First,  if  f(t)  represents  a  speed,  then  we  know  that  the  three  integrals  represent  the  distance  traveled 
between  time  a  and  time  b;  the  distance  traveled  between  time  a  and  time  c;  and  the  distance  traveled 
between  time  c  and  time  b.  Clearly  the  sum  of  the  latter  two  is  equal  to  the  first  of  these. 

Second,  if  f(t)  represents  the  height  of  a  curve,  the  three  integrals  represent  the  area  under  the  curve 
between  a  and  b\  the  area  under  the  curve  between  a  and  c;  and  the  area  under  the  curve  between  c  and  b. 
Again  it  is  clear  from  the  geometry  that  the  first  is  equal  to  the  sum  of  the  second  and  third. 

Proof.  Proof  of  Theorem  6. 14. 

We  want  to  compute  G'{x),  so  we  start  with  the  definition  of  the  derivative  in  terms  of  a  limit: 


G'W 


G(x  +  Ax)  —  G(x) 

Ax->0  Ax 

1  /  /**+Ax  rx  \ 

A x\Sa  f^dt-jaf^dt) 

|  /  rx  rx+Ax 

JSa x{lmd,  +  l  md,~ 

]  rx+Ax 

AlimnA~  /  /(*)*• 

Ax — ^0  Ax  J x 


Now  we  need  to  know  something  about 


when  Ax  is  small;  in  fact,  it  is  very  close  to  Axf(x),  but  we  will  not  prove  this.  Once  again,  it  is  easy  to 
believe  this  is  true  by  thinking  of  our  two  applications:  The  integral 


rx+Ax 

/  fit)dt 

Jx 

can  be  interpreted  as  the  distance  traveled  by  an  object  over  a  very  short  interval  of  time.  Over  a  sufficiently 
short  period  of  time,  the  speed  of  the  object  will  not  change  very  much,  so  the  distance  traveled  will  be 
approximately  the  length  of  time  multiplied  by  the  speed  at  the  beginning  of  the  interval,  namely,  Axf(x). 
Alternately,  the  integral  may  be  interpreted  as  the  area  under  the  curve  between  x  and  x  +  Ax  When  Ax  is 
very  small,  this  will  be  very  close  to  the  area  of  the  rectangle  with  base  Ax  and  height  f'(x) ;  again  this  is 
A xf{x).  If  we  accept  this,  we  may  proceed: 


lim  — 

Ax->0  Ax 


lim 

Ax— >0 


Ax. f(x) 
Ax 


/(*). 


which  is  what  we  wanted  to  show. 


4 


It  is  still  true  that  we  are  depending  on  an  interpretation  of  the  integral  to  justify  the  argument,  but 
we  have  isolated  this  part  of  the  argument  into  two  facts  that  are  not  too  hard  to  prove.  Once  the  last 
reference  to  interpretation  has  been  removed  from  the  proofs  of  these  facts,  we  will  have  a  real  proof  of 
the  Fundamental  Theorem. 
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Now  we  know  that  to  solve  certain  kinds  of  problems,  those  that  lead  to  a  sum  of  a  certain  form,  we 
“merely”  find  an  antiderivative  and  substitute  two  values  and  subtract.  Unfortunately,  finding  antideriva¬ 
tives  can  be  quite  difficult.  While  there  are  a  small  number  of  rules  that  allow  us  to  compute  the  derivative 
of  any  common  function,  there  are  no  such  rules  for  antiderivatives.  There  are  some  techniques  that  fre¬ 
quently  prove  useful,  but  we  will  never  be  able  to  reduce  the  problem  to  a  completely  mechanical  process. 

Due  to  the  close  relationship  between  an  integral  and  an  antiderivative,  the  integral  sign  is  also  used  to 
mean  “antiderivative”.  You  can  tell  which  is  intended  by  whether  the  limits  of  integration  are  included: 


xz  tlx 


is  an  ordinary  integral,  also  called  a  definite  integral,  because 


r2  o3  i3 

/  x2dx=  — - —  = 

h  3  3 


it  has  a  definite  value,  namely 

7 

3' 


We  use 

J  x2  dx 

to  denote  the  antiderivative  of  x  ,  also  called  an  indefinite  integral.  So  this  is  evaluated  as 

j  x2  dx  —  —  +  C. 


It  is  customary  to  include  the  constant  C  to  indicate  that  there  are  really  an  infinite  number  of  antideriva¬ 
tives.  We  do  not  need  this  C  to  compute  definite  integrals,  but  in  other  circumstances  we  will  need  to 
remember  that  the  C  is  there,  so  it  is  best  to  get  into  the  habit  of  writing  the  C.  When  we  compute  a 
definite  integral,  we  first  find  an  antiderivative  and  then  substitute.  It  is  convenient  to  first  display  the 
antiderivative  and  then  do  the  substitution;  we  need  a  notation  indicating  that  the  substitution  is  yet  to  be 
done.  A  typical  solution  would  look  like  this: 


/  xr  dx  =  — 

h  3 


23 

y 


i3 

y 


7 

y 


The  vertical  line  with  subscript  and  superscript  is  used  to  indicate  the  operation  “substitute  and  subtract” 
that  is  needed  to  finish  the  evaluation. 

We  seem  to  have  found  a  pattern.  When  attempting  to  solve  a  previous  question,  we  found  the  an¬ 
tiderivative  of  x2  to  be  x3/3  +  c  (as  it  was  when  solving  the  indefinite  integral).  Likewise,  when  we  first 
began,  we  were  trying  to  determine  a  position  based  on  velocity,  and  3 1  gave  rise  to  3t2/2  +  k. 

As  will  be  formalized  later,  we  see  that  in  these  cases,  the  power  is  increased  to  n  +  1,  but  we  also 
divide  through  by  this  factor,  n+  1.  So  x  becomes  x2 /2,  x2  becomes  x3 /3,  and  x3  will  become  x4/4. 

Now  we  will  also  try  with  negative  and  fraction  values  in  the  following  example. 
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Solution. 


-ifdx 

xz 


X4  2x3/2 

—  H - - — 


4 


1 


/(j)4  t  2(1)3/2  rA 


415 

~6~ 


X 


Properties  of  Definite  Integrals 

1 

Some  properties  are  as  follows: 

fb 

Order  of  limits  matters:  /  f(x)  dx  = 

J  a 

=  -  [  f(x)  dx 

Jb 

If  interval  is  empty,  integral  is  zero: 

J 

f  f(x)  dx  —  0 

'a 

fb 

rb 

Constant  Multiple  Rule:  /  cf(x)  dx 

Ja 

—  c  f(x)  dx 

Ja 

rt>  j 

^b  rb 

Sum/Difference  Rule:  /  f(x)±g(x)dx=  / 

f(x)dx±  /  g(x)dx 

J  a  J c 

i  J  a 

rb 

re  rb 

Can  split  up  interval  [a,  b]  —  [a,  c]  U  [c,  b\ :  /  f(x)  dx—  f(x)  dx+  f(x)  dx 

J  a 

Ja  J  c 

rb 

rb 

The  variable  does  not  matter!:  /  f(x)dx—  /  f(t)dt 

Ja 

Ja 

The  reason  for  the  last  property  is  that  a  definite  integral  is  a  number ,  not  a  function,  so  the  variable  is 
just  a  placeholder  that  won’t  appear  in  the  final  answer. 

Some  additional  properties  are  comparison  types  of  properties. 
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Solution. 

1.  f  2f{x)  —  3g(x)  dx  —  2  /  f{x)dx  —  3  f  g(x)  dx  —  2(7)  —  3(3)  =  5. 

J  d  J Cl  J  d 

2.  [  2g(x)dx——2f  g(x)  dx  —  —  2(3)  =  —  6. 

J  b  Ja 

3-  [  f(x)-g(x)dx  =  0. 

J  d 

4.  f  f(x)dx+  f  f(x)dx—  f  f{x)dx  —  l. 

J  d  J C  Jd 

X 

We  next  evaluate  a  definite  integral  using  three  different  techniques. 
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Solution.  1.  The  shortcut  (FTC  II)  is  the  method  of  choice  as  it  is  the  fastest.  Integrating  and  using  the 
‘ top  minus  bottom ’  rule  we  have: 


x  +  1  dx  — 


x2  2 


0 


2 2  1 

ro2  i 

|y+2J 

i 

to  |  ' 

+ 

o 

1 _ 

=  4. 


2.  We  now  use  the  definition  of  a  definite  integral.  We  divide  the  interval  [0,2]  into  n  subintervals  of 
equal  width  Ax,  and  from  each  interval  choose  a  point  x*.  Using  the  formulas 


Ax 


h  a 


and 


Xj  —  a  +  /Ax, 


we  have 


2  2  i 

Ax  =  -  and  x,-  =  0  +  /Ax  =  — . 
n  n 


Then  taking  x*’s  as  right  endpoints  for  convenience  (so  that  x*  =  x,),  we  have: 

/■  2  n 

/  x+ldx  —  lim  V  /(x*)Ax 

Jo 


iim  £/ 

n — yoo 

i=  1 


2 i\  2 


ft  /  ft 


v  £  /2/  \  2 

hm  V  -  +  1- 

n — ^oo  '  Tl  In 


n—>  oo  f  , 

1 


lim  V 

n — ^oo 


4/  2 


~ ^  H — 
\  nz  n 


i  A  4/  "2 

hm  y — + y 

n—t oo  l  m2 


,i=l  ' 


i=  1 


i  4  "  .  2  "  1 

=  hm  y  i + -  y  i 

n->°°  \  n-  ”  n  “ 


=  lim 


n— \  ft 


4  n(n  +  1)  2 


2  o  +  ~n 

iZ  2  n 


n—¥  °° 


=  lim  (  2H - 1-2 


4. 
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3.  Finally,  let’s  evaluate  the  net  area  under  x  +  1  from  0  to  2. 


Thus,  the  area  is  the  sum  of  the  areas  of  a  rectangle  and  a  triangle.  Hence, 

r2 


x+  1  dx 


Net  Area 

Area  of  rectangle  +  Area  of  triangle 

(2)(l)  +  i(2)(2) 

4. 


We  next  apply  FTC  to  differentiate  a  function. 


X 


Example  6.18:  Using  FTC 

Differentiate  the  following  function: 

sW  =  J 

fX 

‘  cos(l  +5t)  sintdt. 

-2 

Solution  We  simply  apply  the  Fundamental  Theorem  of  Calculus  directly  to  get: 

g'  (x)  =  cos  ( 1  +  5x)  sinx. 


Using  the  Chain  Rule  we  can  derive  a  formula  for  some  more  complicated  problems.  We  have: 

4-  [  ]  f{t)dt  =  f{y{x))-v'[x). 

dX  J a 

Now  what  if  the  upper  limit  is  constant  and  the  lower  limit  is  a  function  of  x?  Then  we  interchange  the 
limits  and  add  a  minus  sign  to  get: 

cl  Ca  cl  /■“(-*) 

—  /  f(t)dt  —  — -  f(t)dt  =  -f(u(x))-u(x). 

dX  Ju(x)  dX  J  a 
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Combining  these  two  we  can  get  a  formula  where  both  limits  are  a  function  of  x.  We  break  up  the 
integral  as  follows: 


'  u(  x) 


rv(x) 

f(t)dt+  /  f{t) 


dt. 


We  just  need  to  make  sure  f(a )  exists  after  we  break  up  the  integral.  Then  differentiating  and  using  the 
above  two  formulas  gives: 


Many  textbooks  do  not  show  this  formula  and  instead  to  solve  these  types  of  problems  will  use  FTC  I 
along  with  the  tricks  we  used  to  derive  the  formula  above.  Either  method  is  perfectly  fine. 


Example  6.19:  FTC  I  +  Chain  Rule 

Differentiate  the  following  integral: 

J 

/■A'2 

‘  f3sin(l  +t)  dt. 

10a 

Solution.  We  will  use  the  formula  above.  We  have  f(t)  —  l3  sin(  1  +  /),  u(x')  —  lOx  and  v(x)  —  x2.  Then 
u!{ x)  —  10  and  v'(x)  =  2x.  Thus, 

1 

—  /  t3sin(l+t)dt  —  (x2)3  sin(l  +  (x2))(2x)  —  (10x)3sin(l  +  (10x))(10) 
dxJ  iox 

=  2x7  sin(  1  +  x2)  —  lOOOOx3  sin(  1  +  lOx) 


* 


Solution.  Using  the  formula  we  have: 


d  f2x 

—  1  +cos  tdt 

dx  Jr 3 


(1  +cos(2x))(2)  —  (1  +cos(x3))(3x2). 


X 
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Exercises  for  Section  6.2 


Exercise  6.2.1  Evaluate  J  t~  +  3tdt 


Exercise  6.2.2  Evaluate  /  sin  tdt 

Jo 

rio  i 

Exercise  6.2.3  Evaluate  /  -dx 

J  l  x 


Exercise  6.2.4  Evaluate  /  ex  dx 

Jo 

Exercise  6.2.5  Evaluate  /  x3  dx 

Jo 

r2 

Exercise  6.2.6  Evaluate  j  x5  dx 

Exercise  6.2.7  Find  the  derivative  of  G(x)  —  J  t2  —  3t  dt 

r  2 

Exercise  6.2.8  Find  the  derivative  of  G(x )  —  J  t  —3tdt 

rx  2 

Exercise  6.2.9  Find  the  derivative  of  G(x)  —  j  el  dt 

Exercise  6.2.10  Find  the  derivative  of  G(x)  —  J  e  dt 

Exercise  6.2.11  Find  the  derivative  of  G(x)  =  J  tan (t2)dt 

ft1  2 

Exercise  6.2.12  Find  the  derivative  of  G(x)  —  /  tan (t  )dt 

JlOx 
r-4 


Exercise  6.2.13  Suppose  J\  f(x)d. x  —  2  and  f*g(x)dx  —  7.  Find  /]4(5/(x)  +  3 g{x))dx  and  /4( 6 
2/(x))  dx. 

Exercise  6.2.14  Suppose  f52f(x)dx  —  3and  J\5  f(x)  dx  =  —2.  Find  f  \2f(X)  dx. 


Exercise  6.2.15  If  f  is  continuous  on  [a,b],  we  define  the  average  of  f(x)  on  [a,b]  to  be 

b  —  a.ja 


1  fb 

aVg[a,b\(f )  =  7—  /  /(x)  dx. 
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(a)  What  is  the  average  of  y/x  on  the  interx’al  [0, 1]? 

(b)  If  the  average  of  f(x)  on  [0,2]  and  on  [2,5]  are  6  and  4  respectively,  then  what  is  its  average  on 
[0,5]? 

6.3  Indefinite  Integrals 


In  this  section  we  focus  on  computing  indefinite  integrals.  The  process  of  finding  the  indefinite  integral  is 
called  integration  (or  integrating  f(x)). 


Solution.  Since  this  is  asking  for  the  most  general  anti-derivative  we  have: 


J  x5  +  3x  —  2  dx 


—  2 x  +  C 


where  C  is  a  constant.  X 

Common  mistakes:  One  habit  students  make  with  integrals  is  to  drop  the  dx  at  the  end  of  the  integral. 
This  is  required!  Think  of  the  integral  as  a  set  of  parenthesis.  Both  are  required  so  it  is  clear  where  the 
integrand  ends  and  what  variable  you  are  integrating  with  respect  to. 

Another  common  mistake  is  to  forget  the  +C  for  indefinite  integrals. 

Note  that  we  don’t  have  properties  to  deal  with  products  or  quotients  of  functions,  that  is, 

j  f(x)-g(x)dx^  j  f(x)  dx  j  g(x)dx. 

[  /(*)  ^r=z  /  f(*)dx 
J  g(x)  ^  f  g(x)dx 

With  derivatives,  we  had  the  product  and  quotient  rules  to  deal  with  these  cases.  For  integrals,  we  have  no 
such  rules,  but  we  will  learn  a  variety  of  different  techniques  to  deal  with  these  cases. 

The  following  integral  rules  can  be  proved  by  taking  the  derivative  of  the  functions  on  the  right  side. 
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Solution.  The  answer  is: 

f{x)  =  J  f'(x)dx  = 


where  C  is  a  constant.  X 


Example  6.23:  Indefinite  Integral 

Find  the  general  indefinite  integral  of  j 

f  3  x2  dx. 

'  +  2x  —  8  sinx)  dx 


x4dx  +  2  /  xdx—  8  /  sin  xdx 


—  +  x1  +  8  cosx  +  C, 
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Solution. 


I  3  x2  dx 


x2  dx 

+  C 


x3+C 


X 


Example  6.24:  Indefinite  Integral 

Find  the  general  indefinite  integral  of  j 

r+dx. 

sJX 

Solution. 


2 


_I 

x  2  dx 


—  9+1 


=  2 


X  2 


+  1 


+c 


—  4y(v  +  C 


X 


Example  6.25:  Indefinite  Integral 

Find  the  general  indefinite  integral  of  j 

0 

+  elx  +  xn  +  7^  dx. 

Solution. 


— b  £  -b  x  -b  7  J  dx  —  j  —  dx  -b  j  £  dx  +  j  x  dx  ~b  j  7  dx 


1  rK+1 

ill  1  7r 

=  lnx+-e/xH - - 

7  71  +  1 


~b  7x  -b  C 


Differential  Equations 


An  equation  involving  derivatives  where  we  want  to  solve  for  the  original  function  is  called  a  differential 
equation.  For  example,  f'{x)  =  2x  is  a  differential  equation  with  general  solution  f(x)  —  x2  +  C.  Some 
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solutions  (i.e.,  particular  values  of  C)  are  shown  below. 


As  seen  with  integral  curves,  we  may  have  an  infinite  family  of  solutions  satisfying  the  differential  equa¬ 
tion.  However,  if  we  were  given  a  point  (called  an  initial  value )  on  the  curve  then  we  could  determine 
f(x)  completely.  Such  a  problem  is  known  as  an  initial  value  problem. 


Example  6.26:  Initial  Value  Problem 

If  f'{x)  =  2x  and  /'(())  =  2  then  determine  f(x). 

Solution.  As  previously  stated,  we  have  a  solution  of: 

fix)  —  x2  +  C. 

But  /( 0)  =  2  implies: 

2  =  02  +  C  ->■  C  =  2. 

Therefore,  f(x)  —  x2  +  2  is  the  solution  to  the  initial  value  problem. 

* 

Exercises  for  Section  6.3 

Find  the  antiderivatives  of  the  functions: 

Exercise  6.3.1  8^/r 

Exercise  6.3.2  3 12  +  1 


Exercise  6.3.3  4/\/x 
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Exercise  6.3.4  2/z2 
Exercise  6.3.5  7s”1 
Exercise  6.3.6  (5x+l)2 
Exercise  6.3.7  (x  —  6) 2 
Exercise  6.3.8  x'^~ 
Exercise  6.3.9  — 

Xy/X 

Exercise  6.3.10  \2t  —  4| 


7.  Techniques  of  Integration 


Over  the  next  few  sections  we  examine  some  techniques  that  are  frequently  successful  when  seeking 
antiderivatives  of  functions. 


7.1  Substitution  Rule 


Needless  to  say,  most  integration  problems  we  will  encounter  will  not  be  so  simple.  That  is  to  say  we  will 
require  more  than  the  basic  integration  rules  we  have  seen.  Here’s  a  slightly  more  complicated  example: 
Find 

I  2xcos (x2)dx. 

This  is  not  a  “simple”  derivative,  but  a  little  thought  reveals  that  it  must  have  come  from  an  application 

r\ 

of  the  chain  rule.  Multiplied  on  the  “outside”  is  2x,  which  is  the  derivative  of  the  “inside”  function  x  . 
Checking: 

sin(x2)  =  cos(x2)-^-.v2  =  2xcos(x2), 


dx 


dx 


so 


1 2xcos(x2)  dx  —  sin(x2)  +  C. 


To  summarize:  If  we  suspect  that  a  given  function  is  the  derivative  of  another  via  the  chain  rule,  we 
let  u  denote  a  likely  candidate  for  the  inner  function,  then  translate  the  given  function  so  that  it  is  written 
entirely  in  terms  of  u,  with  no  x  remaining  in  the  expression.  If  we  can  integrate  this  new  function  of  u, 
then  the  antiderivative  of  the  original  function  is  obtained  by  replacing  u  by  the  equivalent  expression  in  x. 


Even  in  simple  cases  you  may  prefer  to  use  this  mechanical  procedure,  since  it  often  helps  to  avoid 
silly  mistakes.  For  example,  consider  again  this  simple  problem: 


j  2x  cos  (x2)dx. 


Let  u  =  x2,  then  du/dx 
replace  it  by  du: 


=  2x  or  du  =  2 xdx.  Since  we  have  exactly  2xdx  in  the  original  integral,  we  can 


2xcos(x2) dx—  /  cos udu  —  sin u  +  C  =  sin(x2)  +C. 
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This  is  not  the  only  way  to  do  the  algebra,  and  typically  there  are  many  paths  to  the  correct  answer. 
Another  possibility,  for  example,  is:  Since  du/dx  —  2x,  dx  —  du/2x,  and  then  the  integral  becomes 

J  2xcos  (x* 1 2 3 4 5 6)dx  —  j  2xcos  u—  —  J  cos  udu. 

The  important  thing  to  remember  is  that  you  must  eliminate  all  instances  of  the  original  variable  x. 


Example  7.2:  Substitution  Rule 


Solution.  We  let  u  =  ax  +  b  so  du  —  adx  or  dx  =  du/a.  Then 

I  ( ax  +  b)ndx —  [ -undu=— - —  un+l  +C  —  — - ~(ax  +  b)n+l  +  C. 

J  J  a  a(n+ 1)  a{n+ 1) 


Example  7.3:  Substitution  Rule 


Evaluate  /  sin(ax  +  b)  dx,  assuming  that  a  and  b  are  constants  and  a  /  0. 


Solution  Again  we  let  u  —  ax  +  b  so  du  =  adx  or  dx  —  du/a.  Then 

r  r  11  1 

/  sin (ax  +  b)dx—  /  - sinudu  =  cosw)  +C  = — cos (ax  +  b)+C. 
(/  (/  ^ 


Strategy  for  Substitution  Rule 


A  general  strategy  to  follow  is: 

1.  Choose  a  possible  u  =  u(x).  Tip:  Choose  a  substitution  u  so  that  its  derivate  also  appears  in 
the  integral  (up  to  a  constant). 

2.  Calculate  du  =  u'{x)  dx. 

du 

3.  Either  replace  u  (x)  dx  by  du,  or  replace  dx  by  — — ,  and  cancel. 

u'{x) 

4.  Write  the  rest  of  the  integrand  in  terms  of  u.  If  this  is  not  possible,  the  substitution  will  not 
work:  You  must  go  back  to  step  1. 

5.  Find  the  indefinite  integral.  (Again,  if  this  is  not  possible,  try  a  different  substitution,  or  a 
different  method). 

6.  Rewrite  the  result  in  terms  of  x. 
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Example  7.4:  Substitution 

Evaluate  the  following  integral:  j 

f  2X  Jr 

\J 1  —  4x2 

Solution  We  try  the  substitution: 
Then, 


u  —  1  —  4xr. 


du  —  —8.x:  dx 

In  the  numerator  we  have  2x  dx,  so  rewriting  the  differential  gives: 


Then  the  integral  is: 


-du  =  2x  dx. 
4 


I  .  dx  —  f  (l  —  4x2)  (2 x  dx) 

J  v  1  —  4x2  J 


—  /  u 


^ ~ \^u 


-1  \  «1/2  „ 
4  )  1/2  +C 


V 1  —  4.x'2 


+  C 


X 


Example  7.5:  Substitution 

Evaluate  the  following  integral:  j 

f  cosx(sinx)5<iz. 

Solution  In  this  question  we  will  let  u  —  sin.v.  Then, 

du  —  cos.x:  dx. 

Thus,  the  integral  becomes: 

j  cos x(sinx)5  dx  —  j  u5  du 


=  6+C 


sinx 


+  C 


6 
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X 


Example  7.6:  Substitution 

Evaluate  the  following  integral:  j 

fcos(^) 

. —  C.Lyv  • 

yJX 

Solution,  We  use  the  substitution: 
Then, 

Rewriting  the  differential  we  get: 

The  integral  becomes: 


»  =  W2. 


du  =  -x  x^2dx. 
2 


2  du  —  —=  dx. 
\Jx 


cos(i/r) 


dx  —  2  cos  udu 


—  2  sin  u  +  C 


—  2  sin  (i/t)+C 


X 


Example  7.7:  Substitution 

Evaluate  the  following  integral:  j 

f  2x3  \J  x1  +  1  dx. 

Solution.  This  problem  is  a  little  bit  different  than  the  previous  ones.  It  makes  sense  to  let: 

u  —  x2  +  1 , 


then 

Making  this  substitution  gives: 


du  —  2x  dx. 


j  2x3  \A2  +  1  dx 


j  x2  \/ x2  +  1  (2x)  dx 
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This  is  a  problem  because  our  integrals  can’t  have  a  mixture  of  two  variables  in  them.  Usually  this  means 
we  chose  our  u  incorrectly.  However,  in  this  case  we  can  eliminate  the  remaining  x’s  from  our  integral  by 
using: 


We  get: 


u  —  x  -f- 1  — ^  X  —  u  —  1 . 


x2ul^2du  —  \{u-Xyt2d, 


—  /  ll 


2/2-ul'2du 


=  5<5/2U«3/2+c 


=  !(^+l)V2-2(^+ 1)3/2  +  c 


The  next  example  shows  how  to  use  the  Substitution  Rule  when  dealing  with  definite  integrals. 


Example  7.8:  Substitution  Rule 


Solution  First  we  compute  the  antiderivative,  then  evaluate  the  integral.  Let  u  =  x  ,  so  du  =  2 xdx  or 
xdx  =  du/2.  Then 

I  xsin(x2)  dx  —  J  ^  sin  a  du  =  ^(—  cos  a)  +C  =  —  ^cos(x2)  +  C. 


Now 


/4  9  1  ? 
J  xsin(x“) dx  —  —  -cos(x' ) 


-^cos(16)  +  ^cos(4). 


A  somewhat  neater  alternative  to  this  method  is  to  change  the  original  limits  to  match  the  variable  u.  Since 
u  =  x  ,  when  x  —  2,  u  —  4,  and  when  x  —  4,  u  —  16.  So  we  can  do  this: 


r  4  /■  1 6  ]  l 

/  xsin (x2Wx=  /  -sin udu— — (cos u) 

h  J\  2  2y  J 


16 


-^cos(16)  +  ^cos(4). 


An  incorrect,  and  dangerous,  alternative  is  something  like  this: 


=  —  ^-cos(x2) 
2  v  ’ 


=  -^cos(16)  +  ^cos(4). 


r  4  M  l  l 

/  xsin(x2) dx  =  /  - sinwr/M  =  —  -cos(m) 

.72  J2  2  2 

f4  1 

This  is  incorrect  because  /  -  sin  udu  means  that  u  takes  on  values  between  2  and  4,  which  is  wrong.  It 

Ji  2 


is  dangerous,  because  it  is  very  easy  to  get  to  the  point  —-cos (u) 


and  forget  to  substitute  x~  back  in  for 
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u,  thus  getting  the  incorrect  answer  —  ^  cos(4)  +  ^  cos(2).  An  acceptable  alternative  is  something  like: 


xsin(x2)  dx  — 


rx=4  i  ] 

-sin  udu—  — cos(w) 

x=2  2  2 


x=4 


x=2 


—  —~COs(x2) 
2  v  ’ 


cos(16)  cos(4) 


To  summarize,  we  have  the  following. 


Solution.  Let  u  —  sin(^t)  so  du  =  7rcos(;r t)dt  or  du/n  —  cos (nt)dt.  We  change  the  limits  to  sin(7r/4)  = 
V2/2  and  sin(7r/2)  =  1.  Then 


Z'1/2  cos (nt) 
’1/4  sin2(;rz) 


dt  — 


- ;xdu  = 


W 2/2  71  U- 


f1  1  -2j  1  11  1 

/  —  u  du  — - 

ly/2/2  71  71—1 


1  =_I  +  ^ 

V2/2  71  71 


X 


Exercises  for  Section  7.1 

Find  the  following  indefinite  and  definite  integrals. 
Exercise  7.1.1  ^(1—  t)9 dt 

Exercise  7.1.2  ^(x2  +  l)2Jx 

Exercise  7.1.3  f  x(x2  +  l)100  dx 
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Exercise  7.1.4 

Exercise  7.1.5 

Exercise  7.1.6 

Exercise  7.1.7 

Exercise  7.1.8 

Exercise  7.1.9 

Exercise  7.1.10 

Exercise  7.1.11 

Exercise  7.1.12 

Exercise  7.1.13 

Exercise  7.1.14 

Exercise  7.1.15 

Exercise  7.1.16 

Exercise  7.1.17 

Exercise  7.1.18 

Exercise  7.1.19 

Exercise  7.1.20 


1 


s/l-5t 


dt 


sirrxcos  xdx 

x\J  100  —  x2dx 
x2 

dx 

cos(7T/)cos(sin(7r/))  dt 


sin* 

cos3* 


dx 


tan  xdx 

K 

sin5  (3*)  cos  (3  x)dx 

sec2*tan  xdx 
rV7i/2 

/  *sec (*' )tan(*r )dx 
Jo 

[  sin  (tan*) 


cos2* 


4  1 


3  (3*-7)2 

k/ 6 


dx 


dx 


(cos2*  —  sin2*)  dx 


6* 


(*2  —  7)1/9 


dx 


(2*3  —  1 )  (*4  —  2*) 6  dx 


sin  xdx 


l 


I  f{x)f'{x)dx 
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7.2  Powers  of  Trigonometric  Functions 


Functions  consisting  of  powers  of  the  sine  and  cosine  can  be  integrated  by  using  substitution  and  trigono¬ 
metric  identities.  These  can  sometimes  be  tedious,  but  the  technique  is  straightforward.  A  similar  tech¬ 
nique  is  applicable  to  powers  of  secant  and  tangent  (and  also  cosecant  and  cotangent,  not  discussed  here). 

The  trigonometric  substitutions  we  will  focus  on  in  this  section  are  summarized  in  the  table  below: 


Substitution 

u  =  sinx 

u  —  cosx 

u  =  tanx 

u  =  secx 

Derivative 

du  —  cos  xdx 

du  —  —  sin  xdx 

du  —  sec2  xdx 

du  —  secxtanx  dx 

An  example  will  suffice  to  explain  the  approach. 


Solution.  Rewrite  the  function: 


sin 5  xdx  = 


sin  x  sin4  xdx 
sinx(sin2x)2dx 
sinx(l  —  cos  2x)2dx. 


Now  use  u  =  cosx,  du  —  —  sin  xdx: 


sinxfl  —  cos  2x)2dx 


-(1  —  u2)2  du 
-(1  —  2 u2  +  u4)  du 


'  1  +  2  u2  —  u4  du 
2  ,  l  c 

-  ii  T  —u  —  —u  C 

2,1c 

-  cosx  -|-  -  cos  x—  -  cos  x  +  C. 


Observe  that  by  taking  the  substitution  u  —  cosx  in  the  last  example,  we  ended  up  with  an  even  power 
of  sine  from  which  we  can  use  the  formula  sin2x  +  cos2x  =  1  to  replace  any  remaining  sines.  We  then 
ended  up  with  a  polynomial  in  u  in  which  we  could  expand  and  integrate  quite  easily. 

This  technique  works  for  products  of  powers  of  sine  and  cosine.  We  summarize  it  below. 
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Products  of  Sine  and  Cosine 


When  evaluating  J  sin'” * cos'3  xdx: 

1.  The  power  of  sine  is  odd  (m  odd): 

(a)  Use  u  —  cos*  and  du  —  —  sin  xdx. 

(b)  Replace  dx  using  (a),  thus  cancelling  one  power  of  sin*  by  the  substitution  of  du,  and  be 
left  with  an  even  number  of  sine  powers. 

(c)  Use  sin2*  =  1  —  cos2*  (=1-1  u2)  to  replace  the  leftover  sines. 

2.  The  power  of  cosine  is  odd  ( n  odd): 

(a)  Use  u  —  sin*  and  du  —  cos  xdx. 

(b)  Replace  dx  using  (a),  thus  cancelling  one  power  of  cos*  by  the  substitution  of  du,  and  be 
left  with  an  even  number  of  cosine  powers. 

(c)  Use  cos2*  =  1  —  sin2*  (=  1  —  u2)  to  replace  the  leftover  cosines. 

3.  Both  m  and  n  are  odd: 

Use  either  1  or  2  (both  will  work). 

4.  Both  m  and  n  are  even: 

Use  cos2*  =  j  (1  +cos(2*))  and/or  sin2*  =  \  (1  —  cos(2*))  to  reduce  to  a  form  that  can  be 
integrated. 

NOTE:  As  m  and  n  get  large,  multiple  steps  will  be  needed. 


Example  7.12:  Odd  Power  of  Cosine  and  Even  Power  of  Sine 


Evaluate  J  sin6*cos 5 xdx. 


Solution  We  will  show  this  solution  in  two  ways.  First,  we  show  the  solution  by  rewriting  the  function  to 
make  the  substitution  more  clear.  Then,  we  show  the  solution  using  the  above  method. 

Solution  1:  Rewrite  the  function  as  follows. 


J  sin6*cos5*d*  =  J 


sin6  *  cos4  *  cos  *  dx 


j  sin6*(cos2*)2  cos  xdx 
j  sin6*(l  —  sin2*)2  cos* dx 


Then  use  the  substitution  u  =  sin*  and  du  —  cos  xdx,  that  is,  dx 


du 


cos* 


j  sin6x(l  —  sin2*)2  cos  *J*  =  J  w6(l  —  u2)2du 


=  u6  —  2  us  +  w10  du 
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u1  2  u9  un 
~7  9~  U 

sin  x  2sin  x 
~1  9~ 


+  C 


+ 


sinnx 

11 


+  C 


Solution  2:  Since  the  power  of  cosine  is  odd,  we  use  the  substitution  u 


is. 


Solution  2: 

dx 

du 

— 

cosx 

Then  /  s  in6  x  cos5  xdx  is  equal  to: 


sinx  and  du  —  cos xdx,  that 


=  /  it6  cos5  X 


du 


cosx 


=  j  u6  { cos2x)  du 
—  I  u6(l  —  sin2  x)2du 


—  /  u6( 1  —  u2)2 du 


—  /  n6  —  2k8  +  uw  du 


li 


u1  2  u9  u 

~7~  ~V  +  Jl+C 

sin7x  2sin9x  sinnx 
~1  9  +  11  +C 


Using  the  substitution 
Canceling  a  cosx  and  rewriting  cos4x 
Using  trig  identity  cos2x  =  1  —  sin2x 
Writing  integral  in  terms  of  n’s 
Expand  and  collect  like  terms 

Integrating 

Replacing  u  back  in  terms  of  x 


X 


Solution.  Since  the  power  of  cosine  is  odd,  we  use  the  substitution  u  =  sinx  and  du  —  cos  xdx.  This  may 
seem  strange  at  first  since  we  don’t  have  sinx  in  the  question,  but  it  does  work! 


cos3  x 


dx 


cos3x 


du 


cosx 


cos 1  xdu 


1  —  sirnx')  du 


1  —  u)  du 


«  _ 
u  — -+C 

•  3 

sim  x 

sinx - - - bC 


Using  the  substitution 
Canceling  a  cosx 

Using  trig  identity  cos2x  =  1  —  sin2x 
Writing  integral  in  terms  of  u's 

Integrating 

Replacing  u  back  in  terms  of  x 


3 
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Example  7.14:  Product  of  Even  Powers  of  Sine  and  Cosine 


Evaluate  /  sin2  x  cos2  xdx. 


Solution.  Use  the  formulas  sin2x  =  (1  — cos(2x))/2  and  cos2x  =  (1  +  cos(2x))/2  to  get: 

f  .  9  7  ,  f  1  — cos(2x)  l+cos(2x)  , 

/  snrxcos  xdx  —  /  - - - - - dx. 


We  then  have 


'  .  2  2  ,  Cl  —  cos(2x)  l+cos(2.r)  , 

sin“7ccos  xdx  —  /  - - - - - dx 

J  2  2 


—  -  1— cos/2  xdx 
4  J 


co^lxdx 


=  Vx- 


=  ^(x—^Jl+cos4xdx 


If  If  sin4x 
=  4  [X-2  (X+  — 


If  x  sin4x\  _ 
=  4  *-2—r  l+C 


Example  7.15:  Even  Power  of  Sine 


Evaluate  /  sin6 xdx. 


Solution.  Use  sin2x  =  (1  —  cos(2x))/2  to  rewrite  the  function: 

j  sin6 xdx  —  j{sivfx)* idx 
f  (1  — cos2x)3 


8 


dx 


1 


—  -  J  1  —  3cos2x  +  3cos22x  — cos32 xdx. 
Now  we  have  four  integrals  to  evaluate.  Ignoring  the  constant  for  now: 


1  dx  —  x 
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and 


-  3  cos  2 xdx  —  —  -  sin  2x 
2 


are  easy.  The  cos3  2x  integral  is  like  the  previous  example: 


—  cos3  2  xdx 


I  —  cos  2xcos2  2  xdx 
/  —  cos  2x(  1  —  sin2  2x)  dx 


-(1  —  u2)du 
2 v  ’ 


1  (  w3 

—  -  w  — x- 

2  V  3 


1  /  .  _  sin32x 

—  sin2x - 

2  V  3 


And  finally  we  use  another  trigonometric  identity,  cos2x  =  ( 1  +  cos(2x))/2: 

J 3  cos2  2xdx  —  3 

So  at  long  last  we  get 


l+cos4x  ,  3  f  sin4x 

- - - r/x  =  -  xH - - — 

2  2  V  4 


•  6  ,  x  3  .  _  1  f  .  n  sin32x\  3  (  sin4x\ 

sin  xdx  =  -  —  —  sm2x  —  —  I  sin2x - - —  |  +  —  x 4 - - —  +  C. 


8  16 


16 


16 


Next,  we  turn  our  attention  to  products  of  secant  and  tangent.  Some  we  already  know  how  to  do. 


j  sec~ xdx  =  tanx  +  C 
We  can  also  integrate  tanx  quite  easily. 


seextanxr/x  =  secx  +  C 


Solution.  Note  that  tanx 


sinx 

cosx 


and  let  u 


cosx,  so  that  du  —  —  sin  xdx. 


j  tanxr/x 


-  In  Ini  T  C 


—  In  |  cosx|  +C 
In  I  secxl  T  C 


Rewriting  tanx 
Using  the  substitution 

Cancelling  and  pulling  the  —  1  out 

U sing  formula  [  -  dx  —  In  |  u  \  +  C 
J  u 

Replacing  u  back  in  terms  of  x 
Using  log  properties  and  secx  =  1/  cosx 
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X 

Let’s  take  a  moment  to  realize  this  result!  A  common  mistake  is  to  believe  that  /  tan xdx  is  sec2(x)  +  C 
-  this  is  not  true. 


Solution.  Note  that  tan2x  =  sec2x  —  1. 


tan2 xdx  —  /sec2x— 1  dx  Rewriting  tan x 


—  tanx  — x  +  C  Since  /  sec.rdx  =  tanx  +  C 


In  problems  with  tangent  and  secant,  two  integrals  come  up  frequently: 


sec3  xdx  and 


sec  xdx. 


Both  have  relatively  nice  expressions  but  they  are  a  bit  tricky  to  discover. 
First  we  do  j  sec  xdx,  which  we  will  need  to  compute  J  see3  xdx. 


X 


Solution. 


sec  xdx  = 


seex  +  tanx  , 
secx - dx 


seex  +  tanx 
sec2x  +  secxtanx 

seex  +  tanx 


dx. 


Now  let  u  =  seex  +  tanx,  du  —  secxtanx  +  sec 2 xdx,  exactly  the  numerator  of  the  function  we  are  integrat¬ 
ing.  Thus 


J  sec  xdx 


seex  +  seextanx 
seex  +  tanx 


f  -  du  —  lnlwl  +C 
./  u 

In  I  secx  +  tanxl  +C. 


dx 


X 
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Now  we  compute  the  integral  J  sec 3  xdx. 


Example  7.19:  Integral  of  Secant  Cubed 


Solution. 


'l  sec3x  sec3x  sec3x  (tan2x+ 1)  secx 

sec3x  =  — - - 1 - - —  =  — - - b  - - - - 


sec3x  sec  x  tan2  x  secx 
~  +  2  +^“ 

+ 


sec3  x  +  secx  tan2  x  secx 


We  already  know  how  to  integrate  secx,  so  we  just  need  the  first  quotient.  This  is  “simply”  a  matter  of 
recognizing  the  product  rule  in  action: 

/  sec3  x  +  secx  tan2  xdx  —  secxtanx. 


So  putting  these  together  we  get 


^  ,  secxtanx  In  I  secx  +  tanxl  ^ 

sec  xrfx  = - - - 1 - - - bC, 


For  products  of  secant  and  tangent  it  is  best  to  use  the  following  guidelines. 
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Products  of  Secant  and  Tangent 


When  evaluating  j  sec'"xtan"xcfx: 

1.  The  power  of  secant  is  even  (m  even): 

(a)  Use  u  —  tanx  and  du  =  sec2  xdx. 

(b)  Cancel  sec2  x  by  the  substitution  of  dx,  and  be  left  with  an  even  number  of  secants. 

(c)  Use  sec2x  =  1  +  tan2x  (=  1  +  u2)  to  replace  the  leftover  secants. 

2.  The  power  of  tangent  is  odd  ( n  odd): 

(a)  Use  u  —  secx  and  du  =  secxtan xdx. 

(b)  Cancel  one  secx  and  one  tanx  by  the  substitution  of  dx. 

The  number  of  remaining  tangents  is  even. 

(c)  Use  tan2x  =  sec2x—  1  (=  ir  —  1)  to  replace  the  leftover  tangents. 

3.  m  is  even  or  n  is  odd: 

Use  either  1  or  2  (both  will  work). 

4.  The  power  of  secant  is  odd  and  the  power  of  tangent  is  even: 

No  guidelines.  Remember  that  /  sec  xdx  and  /  sec3  xdx  can  usually  be  looked  up. 


Example  7.20:  Even  Power  of  Secant 


Evaluate  /  sec6xtan6xJx. 


Solution.  Since  the  power  of  secant  is  even,  we  ue  u  =  tanx,  so  that  du  —  sec ~xdx. 


sec6  x  tan 6  xdx  — 


sec 6 x(u6) 


du 

sec2x 


Using  the  substitution 
Cancelling  a  sec2x 


sec4x(w6)  du 
( sec2 x)2(u6)du  Rewriting  sec4 x 

(1+tan 2x)2(u6)du  Using  sec2x=  l+tan2x 
1  +  u2 ) 2  ( u6 )  du  U sing  the  substitution 
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To  integrate  this  product  the  easiest  method  is  expand  it  into  a  polynomial  and  integrate  term-by-term. 


sec6xtan6xdx 


(u6  +  2n8  +  n10)  du 


,11 


u1  2  U9  M‘ 

=  T  +  “9“  +  TT+c 


tan7x  2tan9x 
— - - 1 - - —  + 


tan1 1 x 
11 


Expanding 

Integrating 

+  C  Rewriting  in  terms  of  x 


Example  7.21:  Odd  Power  of  Tangent 


Evaluate  j  sec5xtan xdx. 


Solution.  Since  the  power  of  tangent  is  odd,  we  use  u  —  secx,  so  that  du  =  secxtanxdx.  Then  we  have: 
f  sec5xtanxd.Y  =  Jsec5xonx 


du 


secxtanx 


=  J  sec^xdu 

—  /  u4  du 

u5 

=  T+c 


sec5x 


+  C 


Substituting  dx  first 
Cancelling 

Using  the  substitution 
Integrating 

Rewriting  in  terms  of  x 


Example  7.22:  Odd  Power  of  Secant  and  Even  Power  of  Tangent 


Solution  The  guidelines  don’t  help  us  in  this  scenario.  However,  since  tan2x  =  sec2x—  1,  we  have 


J  secxtan2xdx  =  ^secx(sec2x— 1  )dx 


f  sec3  x  — sec  xWx 


-  (secxtanx  +  In  |  secx  +  tanx|)  —  In  |  secx  +  tanx|  +C 

^  secxtanx  +  ^  In  |  secx  +  tanx|  —  In  |  secx  +  tanx|  +  C 

1  1  .  ^ 

-  secxtanx - In  secx  +  tanx  +  C 

2  2  1  1 


7.2.  Powers  of  Trigonometric  Functions  269 


* 

Exercises  for  7.2 


Find  the  antiderivatives. 


Exercise  7.2.1  j 

'  sin 2  xdx 

Exercise  7.2.2  j 

f  sin3.r  dx 

Exercise  7.2.3  j 

f  sin4.r  dx 

Exercise  7.2.4  j 

f  cos2  v  sin3  v  dx 

Exercise  7.2.5  j 

f  cos  3  xdx 

Exercise  7.2.6  j 

f  cos3  v  sin 2  xdx 

Exercise  7.2.7  j 

f  sinv(cos  x)3^2dx 

Exercise  7.2.8  j 

f  sec2  v  esc 2  xdx 

Exercise  7.2.9  j 

f  tan3xsecxJx 

Exercise  7.2.10 

/  (  1  +  '  )  dx 

J  \cscx  secx J 

Exercise  7.2.11 

f  cos2x  +  cosx+  1  , 

/  - 5 - dx 

J  cos^x 

Exercise  7.2.12 

/  x  sec2  (x2 )  tan4  (x2 )  dx 
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7.3  Trigonometric  Substitutions 


So  far  we  have  seen  that  it  sometimes  helps  to  replace  a  subexpression  of  a  function  by  a  single  variable. 
Occasionally  it  can  help  to  replace  the  original  variable  by  something  more  complicated.  This  seems  like 
a  “reverse”  substitution,  but  it  is  really  no  different  in  principle  than  ordinary  substitution. 


Solution.  Let  x  =  sinzz  so  dx  =  cos  udu.  Then 


1  —  x2  dx  — 


1  —  sirrucosudu  —  J  V cos2zzcoszzz/zz. 


We  would  like  to  replace  V cos2  zz  by  cos  zz,  but  this  is  valid  only  if  cos  zz  is  positive,  since  V cos2  zz  is 
positive.  Consider  again  the  substitution  x  =  sinzz.  We  could  just  as  well  think  of  this  as  u  —  arcsin.r.  If 
we  do,  then  by  the  definition  of  the  arcsine,  —  7t/2  <u<n/ 2,  so  coszz  >  0.  Then  we  continue: 


ucosudu 


J  cos2 udu 

f  1  +  cos2zz  zz  sin2  zz 

J  2  2  4 

arcsin.r  sin(2arcsinx) 

- x - + - - - ; - ~+C. 


+  C 


This  is  a  perfectly  good  answer,  though  the  term  sin(2arcsinx)  is  a  bit  unpleasant.  It  is  possible  to  simplify 
this.  Using  the  identity  sin2x  =  2  sinxcosx,  we  can  write  sin2zz  =  2  sinzz  cos  u  —  2  sin(arcsin.r)  \J  1  —  sin2  zz  = 
2xJ\  —  sin2 (arcsin.r)  =  2.ry/ 1  —  .r2.  Then  the  full  antiderivative  is 


arc sin  x 
2 


+ 


2x\/ 1  —  x2 

4 


arcsinx 

2 


+  C. 


X 


This  type  of  substitution  is  usually  indicated  when  the  function  you  wish  to  integrate  contains  a  poly- 

9  9 

nomial  expression  that  might  allow  you  to  use  the  fundamental  identity  sin  x  +  cos  x  =  1  in  one  of  three 
forms: 

cos2x  =  1  —  sin2x  sec2x  =  1  +tan2x  tan2x  =  sec2x— 1. 

9  9 

If  your  function  contains  1  —  x  ,  as  in  the  example  above,  try  x  =  sinzz;  if  it  contains  1  +x  try  x  =  tanzz; 
and  if  it  contains  x  —  1,  try  x  =  seczz.  Sometimes  you  will  need  to  try  something  a  bit  different  to  handle 
constants  other  than  one  which  we  will  describe  below.  First  we  discuss  inverse  substitutions. 

In  a  traditional  substitution  we  let  zz  =  zz(x),  i.e.,  our  new  variable  is  defined  in  terms  of  x.  In  an 
inverse  substitution  we  let  x  =  g(u),  i.e.,  we  assume  x  can  be  written  in  terms  of  zz.  We  cannot  do  this 
arbitrarily  since  we  do  NOT  get  to  “choose”  x.  For  example,  an  inverse  substitution  of  x  =  1  will  give  an 
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obviously  wrong  answer.  However,  when  x  =  g(u)  is  an  invertible  function,  then  we  are  really  doing  a 
^-substitution  with  u  =  g~  1  (x).  Now  the  substitution  rule  applies. 

Sometimes  with  inverse  substitutions  involving  trig  functions  we  use  0  instead  of  u.  Thus,  we  would 
take  x  —  sin  9  instead  of  x  —  sin  u.  However,  as  we  discussed  above,  we  would  like  our  inverse  substitution 
x  =  g{u)  to  be  a  one-to-one  function,  and  x  —  sinw  is  not  one-to-one.  We  can  overcome  this  issue  by  using 
the  restricted  trigonometric  functions.  The  three  common  trigonometric  substitutions  are  the  restricted 
sine,  restricted  tangent  and  restricted  secant.  Thus,  for  sine  we  use  the  domain  [—ft/ 2,  ft/ 2]  and  for 
tangent  we  use  {—'ll/ 2,  n/2).  Depending  on  the  convention  chosen,  the  restricted  secant  function  is 
usually  defined  in  one  of  two  ways. 


One  convention  is  to  restrict  secant  to  the  region  [0,  n/2)  U  {ft/ 2,  k\  as  shown  in  the  middle  graph.  The 
other  convention  is  to  use  [0,  ft/ 2)  U  [ft,  3ft /2)  as  shown  in  the  right  graph.  Both  choices  give  a  one-to- 
one  restricted  secant  function  and  no  universal  convention  has  been  adopted.  To  make  the  analysis  in  this 
section  less  cumbersome,  we  will  use  the  domain  [0,  ft/ 2)  U  [ft,  3ft / 2)  for  the  restricted  secant  function. 
Then  sec_1x  is  defined  to  be  the  inverse  of  this  restricted  secant  function. 

Typically  trigonometric  substitions  are  used  for  problems  that  involve  radical  expressions.  The  table 
below  outlines  when  each  substitution  is  typically  used  along  with  their  intervals  of  validity. 


Expression 

Substitution 

Validity 

V a2  —x2 

x  =  a  sin  9 

9  e  [-ft/2,  ft/2] 

V a2  +x2  or  a2  +x2 

x  —  aidx\9 

9  e  {-ft/2,  ft/2 ) 

Vx2  —  a2 

x  —  a  sec  9 

9  G  [0,  ft/2)  U  [ft,  3ft /2) 

All  three  substitutions  are  one-to-one  on  the  listed  intervals.  When  dealing  with  radicals  we  often  end  up 
with  absolute  values  since 


z 


For  each  of  the  three  trigonometric  substitions  above  we  will  verify  that  we  can  ignore  the  absolute  value 
in  each  case  when  encountering  a  radical. 

For  x  —  a  sin  0,  the  expression  V a2  —x2  becomes 


V 


a 2  sin2  9 


sin2  9)  —  aV cos2  9 


a\cos9 


a  cos  9 
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This  is  because  cos  6  >  0  when  0  G  [—n/2,  n/2],  Forx  =  «tan0,  the  expression  V a2  +  x 2  becomes 

\J a2  +  x2  —  \J a 2  +  a 2  tan2  0  —  \J a2(  1  +  tan2  0)  —  aV sec2  0  —  a\ sec0|  —  asecO 

This  is  because  sec  0  >  0  when  0  G  (—n/2,  n/2). 

Finally,  for  x  —  a  sec  0,  the  expression  Vx2  —  a 2  becomes 

\J x2  —  a2  —  \J  a 2  sec2  0  —  a2  —  \J  a2(  sec2  0  —  1)  =  aV  tan2  0  —  a|tan0|  =  atan0 

This  is  because  tan0  >  0  when  0  G  [0,  n/2)  U  [n,  ?>n/2). 

Thus,  when  using  an  appropriate  trigonometric  substitution  we  can  usually  ignore  the  absolute  value. 
After  integrating,  we  typically  get  an  answer  in  terms  of  0  (or  u)  and  need  to  convert  back  to  x’s.  To  do 
so,  we  use  the  two  guidelines  below: 

•  For  trig  functions  containing  0,  use  a  triangle  to  convert  to  x’s. 

•  For  0  by  itself,  use  the  inverse  trig  function. 

All  pieces  needed  for  such  a  trigonometric  substitution  can  be  summarized  as  follows: 


Expression 

Substitution 

Differential 

Identity 

Inverse  of 
Substitution 

x  —  a  sin  0 

dx  =  ac os  Odd 

e=sin->(|) 

V  a2  —  x2 

Va2  —x2  =  a  cos  0 

y/  a2  +  x2 

or 

a 2  +  x2 

x  =  a  tan  0 

dx  —  a  sec2  OdO 

V a2  +  x2  —  a  sec  0 

«=tan-‘(|) 

x  —  a  sec  0 

dx  —  <3sec0tan0£/0 

0  = sec'1 (|) 

V x2  —  a 2 

Vx2  —  a2  —  <7tan0 

To  emphasize  the  technique,  we  redo  the  computation  for 


1  —  x2  dx. 


Solution.  Since  x/ 1  —  x2  appears  in  the  integrand  we  try  the  trigonometric  substitution  x  =  sin  0.  (Here  we 
are  using  the  restricted  sine  function  with  0  G  [— 7t/2,  n/2]  but  typically  omit  this  detail  when  writing  out 
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the  solution.)  Then  dx  —  cos  OdO  . 


\/l  —  x2  dx  —  I  \ZT  —  sin2  0 1  cos  0  d0~\  Using  our  (inverse)  substitution 
=  j  V i cos2  0  cos  0  dQ  Since  sin2  0  +  cos2  0  =  1 

=  /  I  cos  01  ■  cos  0d6  Since  V cos2  0  —  I  cos  0 1 


=  /  cos 1  OdO 


Since  for  9  E  [—  f ,  f  ]  we  have  cos  0  >0. 


Often  we  omit  the  step  containing  the  absolute  value  by  our  discussion  above.  Now,  to  integrate  a  power 
of  cosine  we  use  the  guidelines  for  products  of  sine  and  cosine  and  make  use  of  the  identity 

n.  1 

cos2  9  —  -(1  +cos(20)). 


Our  integral  then  becomes 

1  —  x2dx  —  ^  j (l  +cos(2 0))d0  —  ^  +  s^n(-^) 

To  write  the  answer  back  in  terms  of  .r  we  use  a  right  triangle.  Since  sin  0  —  x/\  we  have  the  triangle: 


The  triangle  gives  sin  0,  cos  0,  tan 0,  but  have  a  sin(20).  Thus,  we  use  an  identity  to  write 


sin(20)  =  2 sin  0  cos  0  =  2  (  y  j  |  ^ 


For  0  by  itself  we  use  0  =  sin  1  x.  Thus,  the  integral  is 


- r  ,  sin  lx  x\J  1  —  x2 

1  -x~dx  =  — - - 1 - - - hC 


Example  7.25:  Secant  Subsitution 


^  ,  f  V25x2  -  4  , 
Evaluate  /  - dx. 
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Solution  We  do  not  have  Vx2  —  a2  because  of  the  25,  but  if  we  factor  25  out  we  get: 


n/25(*2  —  (4/25)) 


dx  — 


Now,  a  =  2/5,  so  letx  =  |sec0.  Alternatively,  we  can  think  of  the  integral  as  being: 


dx 


Then  we  could  let  u  —  5x  followed  by  u  —  2  sec  0,  etc.  Or  equivalently,  we  can  avoid  a  ^-substitution  by 
letting  5x  —  2  sec  0.  In  either  case  we  are  using  the  trigonometric  substitution  x  =  |  sec  0,  but  do  use  the 


method  that  makes  the  most  sense  to  you!  As  x  —  |  sec  0  we  have  dx 


\  sec  0  tan  0  <70 


\/25x2 


dx 


25- 


25 


—  4 


|  sec  0 


|  sec  0  tan  0  <70 


'4(sec2  0  —  1)  •  tan  0  <70 
=  2  I  V tan2  0  ■  tan  0  <7  0 
—  ^  j tan2  ®  ^  ^ 

=  2  I  (sec2  0  —  1)  <70 
=  2(tan0  —  0)  +C 


Using  the  substitution 
Cancelling 

Using  tan2  0  +  1  =  sec2  0 
Simplifying 

Using  tan2  0  +  1  =  sec2  0 
Since  /  sec2  0  <70  =  tan  0  +  C 


For  tan  0,  we  use  a  right  triangle. 

2 

x  =  -  sec  0  — > 

Using  SOH  CAH  TOA,  the  triangle  is  then 


2  1 
5  cos0 


-A 


COS0 


2 

5x 


For  0  by  itself,  we  use  0  —  sec  1  (5x/2).  Thus, 


V  25x2  —  4 


dx—2 


V  25x2  —  4 


x 


5x 

sec  *|y 


+  C 


In  the  context  of  the  previous  example,  some  resources  give  alternate  guidelines  when  choosing  a 
trigonometric  substitution. 
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V  a2  —  b2x2  — >  x—ysin6 

b 


\J b2x2  +  a 2  or  (b2x2+a2) 


— >  x  —  -  tan  0 

b 


V b2x2  —  a 2  — >  x  —  y  sec  6 

b 


We  next  look  at  a  tangent  substitution. 


Example  7.26:  Tangent  Substitution 


Solution.  Let  x  =  5 tan  0  so  that  dx  —  5 sec2  Odd  . 


f 

dx 

-  f  > _ 

5  sec2  0  dO 

1  V25+x2 

J  V25  +  25tan20 

Using  our  substitution 


\/ 25 ( 1  +  tan2  0) 


•5  sec ~0d6  Factor  out  25 


1 


5x/  sec2 6 
sec  0  d6 


■  5  sec2  0  dO 


—  In  |  sec  0  +  tan  0  \  +  C 
Since  tan  0  —x/5,  we  draw  a  triangle: 


Using  tan2  0  +  1  =  sec2  0 
Simplifying 

By  f  sec  0  dx  —  In  |  sec  0  +  tan  0 1  +  C 


x 


Then 


Therefore,  the  integral  is 


sec0  = 


1  V25+X2 


V25+x2 


cos  0 


dx  —  In 


V25  +  .r2  (  x 
5  +  5 


+  C 


X 
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In  the  next  example,  we  will  use  the  technique  of  completing  the  square  in  order  to  rewrite  the  inte¬ 
grand. 


Example  7.27:  Completing  the  Square 


Evaluate 


x 


J  V  3  — 2x  —  x2 


dx. 


Solution  First,  complete  the  square  to  write 

3  —  2 x  —  x2  —  4  —  (x  +  l)2 

Now,  we  may  let  u  —  x  +  1  so  that  du  —  dx  (note  that x  —  u  —  1)  to  get: 

^  dx—  [  ,  du 


'  \/4-(x+  l)2 

Let  u  —  2sin0  giving  du  =  2 cos  0d0\ 


\J 4  —  u2 


du—  .2 cos QdQ  —  I  (2sin0  —  l)d6 


V4-u2  J  2  cos  6 


Integrating  and  using  a  triangle  we  get: 

f  x 


V  3  —  2x  —  x2 


—  — 2cos  0  —  6+C 

—  —  ’sj  4  —  u2  —  sin- 1  (^j  +C 
=  —\/3  —  2x  —  x2  —  sin-1  (X  +  -  )  +C 


Note  that  in  this  problem  we  could  have  skipped  the  n-substitution  if  instead  we  let  x+  1  =  2sin  6.  (For 

x  H-  1 

the  triangle  we  would  then  use  sin  6  —  — - — .)  X 


Exercises  for  7.3 


Exercise  7.3.1 


c2  —  1  dx 


Exercise  7.3.2  /  \J9  +  4x2  dx 


Exercise  7.3.3  j  x\/ 1  —x2dx 
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Exercise  7.3.4 

Exercise  7.3.5 

Exercise  7.3.6 

Exercise  7.3.7 

Exercise  7.3.8 

Exercise  7.3.9 

Exercise  7.3.10 


x2  \J  1  —  x2  dx 

1 


y/l  +x2 


dx 


x2  +  2 xdx 


1 


JC2(1  +x 2) 
Jl 


dx 


\J  \  —  X2 

fx 


dx 


\/\—x 

v-3 


dx 


a/  4x2  —  1 


dx 


7.4  Integration  by  Parts 


We  have  already  seen  that  recognizing  the  product  rule  can  be  useful,  when  we  noticed  that 

j  sec3  u  +  sec  u  tan2  udu  —  sec  u  tan  u. 

As  with  substitution,  we  do  not  have  to  rely  on  insight  or  cleverness  to  discover  such  antiderivatives;  there 
is  a  technique  that  will  often  help  to  uncover  the  product  rule. 

Start  with  the  product  rule: 

=  f'(x)g(x)  +  f(x)g'(x). 

We  can  rewrite  this  as 

f{x)g(x)  =  f  f'{x)g(x)  dx  +  I  f{x)g{x)dx, 

and  then 

I  f(x)g'(x)  dx  =  f(x)g(x)  -  I  f'(x)g(x)  dx. 

This  may  not  seem  particularly  useful  at  first  glance,  but  it  turns  out  that  in  many  cases  we  have  an  integral 
of  the  form 

I  f(x)g'(x)dx 

but  that 

J  f{x)g{x)dx 
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is  easier.  This  technique  for  turning  one  integral  into  another  is  called  integration  by  parts,  and  is  usually 
written  in  more  compact  form.  If  we  let  u  =  f(x)  and  v  =  g(x)  then  du  —  f'(x)  dx  and  dv  =  g'(x)  dx  and 

J  udv  —  uv—  j  vdu. 

To  use  this  technique  we  need  to  identify  likely  candidates  for  u  =  /(x)  and  dv  =  g'(x)  dx. 


Example  7.28:  Product  of  a  Linear  Function  and  Logarithm 


Evaluate  /  xlnxdx. 


Solution.  Let  u  =  lnx  so  du  —  1  /x dx.  Then  we  must  let  dv  —  x dx  so  v  =  x  / 2  and 

x2  lnx 


xlnxdx  = 


'  x2  1  a2  I  n  x 

i~xix=^r 


x  x2  lnx 
-  dx  —  — - — 
2  2 


*2  ^ 
4+C 


Example  7.29:  Product  of  a  Linear  Function  and  Trigonometric  Function 


Solution.  Let  u  —  x  so  du  —  dx.  Then  we  must  let  dv  —  sin xdx  so  v  =  —  cos  a  and 


xslnxdx  —  —  .rcosx—  /  —  cosxdx  —  —  ,rcosx+  /  cosxdx  —  —  xcosx+  sin.r  +  C. 


Solution.  Of  course  we  already  know  the  answer  to  this,  but  we  needed  to  be  clever  to  discover  it.  Here 

/j 

we’ll  use  the  new  technique  to  discover  the  antiderivative.  Let  u  —  secx  and  dv  —  sec  xdx.  Then  du  = 
sec  a  tan  a  and  v  =  tan  a  and 


sec  xdx  —  secxtanA  —  J  tan  xsecxdx 

—  secxtanx—  ^(sec2x—  1)  secxdx 

—  secxtanx—  /  sec 3xdx+  /  secxdx. 
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At  first  this  looks  useless — we’re  right  back  to  j  sec3x dx.  But  looking  more  closely: 


J  sec 3  xdx 
sec3  xdx  +  j  sec3  xdx 
2  /  see3  xdx 


—  seextanx—  J  sec3  xdx  +  j  sec  xdx 

—  secxtanx+  j  sec  xdx 

—  seextanx  +  /  sec  xdx 


sec3  xdx  — 


seextanx  1  f 

- - - h  -  /  secxdx 

2  2 


seextanx  In  I  secx  +  tanxl 
=  — [+c 


Example  7.31:  Product  of  a  Polynomial  and  Trigonometric  Function 


Evaluate 


sin  xdx. 


Solution.  Let  u  —  x2,  dv  —  sin  xdx;  then  du  =  2  xdx  and  v  =  —  cosx.  Now 

j  x2sinxdx  =  —  x2cosx+  j  2xcos xdx. 

This  is  better  than  the  original  integral,  but  we  need  to  do  integration  by  parts  again.  Let  u  =  2x,  dv  — 
cos  xdx,  then  du  —  2  and  v  =  sinx,  and 

j  x2  smxdx  =  — x2cosx+  j  2xcosxdx 

—  —x2  cosx  +  2x  sinx  —  J  2  sinxdx 
=  — x2cosx  +  2xsinx  +  2cos  x  +  C. 

X 

Such  repeated  use  of  integration  by  parts  is  fairly  common,  but  it  can  be  a  bit  tedious  to  accomplish, 
and  it  is  easy  to  make  errors,  especially  sign  errors  involving  the  subtraction  in  the  formula.  There  is  a 
nice  tabular  method  to  accomplish  the  calculation  that  minimizes  the  chance  for  error  and  speeds  up  the 
whole  process.  We  illustrate  with  the  previous  example.  Here  is  the  table: 


sign 

u 

dv 

+ 

x2 

sinx 

- 

2x 

—  cosx 

+ 

2 

—  sinx 

- 

0 

cosx 

To  form  this  table,  we  start  with  u  at  the  top  of  the  second  column  and  repeatedly  compute  the  derivative; 
starting  with  dv  at  the  top  of  the  third  column,  we  repeatedly  compute  the  antiderivative.  In  the  first 
column,  we  place  a  ”  in  every  second  row.  To  form  the  second  table  we  combine  the  first  and  second 
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columns  by  ignoring  the  boundary;  if  you  do  this  by  hand,  you  may  simply  start  with  two  columns  and 
add  a  ”  to  every  second  row. 

Alternatively,  we  can  use  the  following  table: 


u 

dv 

xz 

sinx 

— 2x 

—  cosx 

2 

—  sinx 

0 

cosx 

To  compute  with  this  second  table  we  begin  at  the  top.  Multiply  the  first  entry  in  column  u  by  the  second 
entry  in  column  dv  to  get  —  x“cosx,  and  add  this  to  the  integral  of  the  product  of  the  second  entry  in 
column  u  and  second  entry  in  column  dv.  This  gives: 


? 

— X  COSX  + 


2xcos  xdx. 


or  exactly  the  result  of  the  first  application  of  integration  by  parts.  Since  this  integral  is  not  yet  easy,  we 
return  to  the  table.  Now  we  multiply  twice  on  the  diagonal,  (x2)(— cosx)  and  (— 2x)(— sinx)  and  then 
once  straight  across,  (2)  (—  sinx),  and  combine  these  as 


— x  cosx +  2x  sinx  — 


2  sin  xdx. 


giving  the  same  result  as  the  second  application  of  integration  by  parts.  While  this  integral  is  easy,  we 
may  return  yet  once  more  to  the  table.  Now  multiply  three  times  on  the  diagonal  to  get  (x2)(  — cosx), 
(— 2x)(—  sinx),  and  (2) (cosx),  and  once  straight  across,  (0)(cosx).  We  combine  these  as  before  to  get 


— x2cosx  +  2xsinx  +  2cosx  +  /  Odx  —  —  x"cosx  +  2xsinx  +  2cosx  +  C. 


Typically  we  would  fill  in  the  table  one  line  at  a  time,  until  the  “straight  across”  multiplication  gives  an 
easy  integral.  If  we  can  see  that  the  u  column  will  eventually  become  zero,  we  can  instead  fill  in  the  whole 
table;  computing  the  products  as  indicated  will  then  give  the  entire  integral,  including  the  “+C”,  as  above. 


Exercises  for  7.4 


Find  the  antiderivatives. 
Exercise  7.4.1  /  xcos xdx 


Exercise  7.4.2  /  x^cos xdx 


Exercise  7.4.3 


xex  dx 
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Exercise  7.4.4 

Exercise  7.4.5 

Exercise  7.4.6 

Exercise  7.4.7 

Exercise  7.4.8 

Exercise  7.4.9 

Exercise  7.4.10 

Exercise  7.4.11 

Exercise  7.4.12 

Exercise  7.4.13 

Exercise  7.4.14 


xex  dx 


sin  xdx 


In  xdx 


x  arc  tan  xdx 


xi  sin  xdx 


x3  cos  xdx 


x  sin- xdx 


xsinxcosxdx 


x)  dx 


sin(v^)dx 


sec2  x  esc2  xdx 


7.5  Rational  Functions 


A  rational  function  is  a  fraction  with  polynomials  in  the  numerator  and  denominator.  For  example, 

x3  1  x2  + 1 

x2+x  — 6’  (x  — 3)2’  x2  —  1’ 

are  all  rational  functions  of  x.  There  is  a  general  technique  called  “partial  fractions”  that,  in  principle, 
allows  us  to  integrate  any  rational  function.  The  algebraic  steps  in  the  technique  are  rather  cumbersome 
if  the  polynomial  in  the  denominator  has  degree  more  than  2,  and  the  technique  requires  that  we  factor 
the  denominator,  something  that  is  not  always  possible.  However,  in  practice  one  does  not  often  run 
across  rational  functions  with  high  degree  polynomials  in  the  denominator  for  which  one  has  to  find  the 
antiderivative  function.  So  we  shall  explain  how  to  find  the  antiderivative  of  a  rational  function  only  when 
the  denominator  is  a  quadratic  polynomial  ax  +bx  +  c. 
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We  should  mention  a  special  type  of  rational  function  that  we  already  know  how  to  integrate:  If  the 
denominator  has  the  form  (ax  +  b)n,  the  substitution  u  —  ax  +  b  will  always  work.  The  denominator 
becomes  u'\  and  each  x  in  the  numerator  is  replaced  by  (w  —  b)/a,  and  dx  —  du/a.  While  it  may  be  tedious 
to  complete  the  integration  if  the  numerator  has  high  degree,  it  is  merely  a  matter  of  algebra. 


Solution  Using  the  substitution  u  —  3  —  2x  we  get 

3  ,  „  (u-3\3 


(3  —  2x)5 


dx  —  — - 
-2 

1 


{*=zY  ,  1  f  u3  —  9u2  +  21 u  —  27  7 

—  c — du—  —  /  - ? - du 


uJ 


16, 


uJ 


—  ——  I  u  —9 u  +27 u  —  21 u  du 
16  J 

1  ( m”1  9 u~2  27 u~3  27m-4', 

+ - - - —  +C 


16  V  -1  -2 


-3 


-4 


1  /(3-2x)-1  9(3  -2x)~2  27(3  —  2x)~3  27(3 -2.r )' 

16  {  ^2  +  ^3  ^4 


1 


+ 


+ 


27 


16(3 -2x)  32(3  -2x)2  16(3 -2x)3  64(3  -2x) 


+  C 


+  C 


We  now  proceed  to  the  case  in  which  the  denominator  is  a  quadratic  polynomial.  We  can  always 
factor  out  the  coefficient  of  x  and  put  it  outside  the  integral,  so  we  can  assume  that  the  denominator 
has  the  form  x  +bx  +  c.  There  are  three  possible  cases,  depending  on  how  the  quadratic  factors:  either 
x2  +  bx  +  c  =  (a  —  r)(x  —  s),  x2  +bx  +  c  —  (x  —  r)2,  or  it  doesn’t  factor.  We  can  use  the  quadratic  formula 
to  decide  which  of  these  we  have,  and  to  factor  the  quadratic  if  it  is  possible. 


Example  7.33:  Factoring  a  Quadratic 


'j 

Determine  whether  x  +x+  1  factors,  and  factor  it  if  possible. 


Solution.  The  quadratic  formula  tells  us  that  x2  +  x  +  1  =  0  when 

-1±VT2I4 

2  ' 

Since  there  is  no  square  root  of  —3,  this  quadratic  does  not  factor.  X 


Example  7.34:  Factoring  a  Quadratic  with  Real  Roots 


'j 

Determine  whether  x  —  x  —  1  factors,  and  factor  it  if  possible. 
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Solution  The  quadratic  formula  tells  us  that  x  —  x  —  1  =  0  when 

1  ±  vT  +  4  1  ±  \/5 


x  = 


Therefore 


■  x  —  1  =  x 


l  +  y/5" 


If  x2  +  bx  +  c  —  (x  —  r)2  then  we  have  the  special  case  we  have  already  seen,  that  can  be  handled  with 
a  substitution.  The  other  two  cases  require  different  approaches. 

If  x2  +  bx  +  c  —  (x  —  r )  (x  —  5) ,  we  have  an  integral  of  the  form 


p(x) 


dx 


J  (x-r)(x-s) 

where  p(x)  is  a  polynomial.  The  first  step  is  to  make  sure  that  p(x)  has  degree  less  than  2. 


Solution.  To  do  this  we  use  long  division  of  polynomials  to  discover  that 

x3  x3  7x  —  6  lx  — 6 

(x  — 2)(x  +  3)  x2+x  — 6  "*”x2+x  — 6  (x  — 2)(x  +  3) 

See  http://en.wikipedia.org/wiki/Polynomial_long_division  for  a  review  on  long  division. 
Then 

f  x3  f  ,  ,  f  lx- 6 

/  7 - X77 - XV  dx—  x  —  1  dx+  /  7 - —7 - —  dx. 

J  (x  — 2)(x  +  3)  J  J  (x  — 2)(x  +  3) 

The  first  integral  is  easy,  so  only  the  second  requires  some  work.  X 

Now  consider  the  following  simple  algebra  of  fractions: 

A  B  A(x  —  5)  +B(x  —  r)  (. A  +  B)x—As  —  Br 

x-r  x-s  (x-r)(x-s)  (x-r)(x-s) 

That  is,  adding  two  fractions  with  constant  numerator  and  denominators  (x  —  r)  and  (x  —  s )  produces  a 
fraction  with  denominator  (x  —  r)  (x  —  5)  and  a  polynomial  of  degree  less  than  2  for  the  numerator.  We 
want  to  reverse  this  process:  Starting  with  a  single  fraction,  we  want  to  write  it  as  a  sum  of  two  simpler 
fractions.  An  example  should  make  it  clear  how  to  proceed. 
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Solution.  We  start  by  writing 


lx  — 6 

(x  — 2)(x  +  3) 


as  the  sum  of  two  fractions.  We  want  to  end  up  with 


lx  — 6  A  B 

{x—  2)  (x  +  3)  x—  2~^x  +  3 

If  we  go  ahead  and  add  the  fractions  on  the  right  hand  side,  we  seek  a  common  denominator,  and  get: 

lx  — 6  _{A  +  B)x  +  3A-2B 

(x  —  2)  (x  +  3)  (x  —  2)  (x  +  3) 


So  all  we  need  to  do  is  find  A  and  B  so  that  lx  —  6  —  (A  +  B)x  +  3A  —  2B,  which  is  to  say,  we  need 
7  =  A  +  B  and  —6  =  3A  —  2 B.  This  is  a  problem  you’ve  seen  before:  Solve  a  system  of  two  equations 
in  two  unknowns.  There  are  many  ways  to  proceed;  here’s  one:  If  7  —  A  +  B  then  B  —  1  —  A  and  so 
—6  —  3A  —  2B  —  3A  —  2{1—A)  —  3A  —  14  +  2A  =  5A  —  14.  This  is  easy  to  solve  for  A:  A  =  8/5,  and  then 
B  —  l—A  —  l  —  S/5  —  21/5.  Thus 


lx  — 6  ,  f  8  1  27  1  8,  27,  .  ^ 

dx  —  I  - - -  +  — - -  dx  —  -  In  x  —  2  +  —  In  x  +  3  +C. 


J  (x  — 2)(x  +  3) 

The  answer  to  the  original  problem  is  now 

r  v-3 


5x  — 2  5  x  +  3 


(x  2)  (x  T  3) 


dx  —  /  x  —  Idx  + 


lx  —  6 


(x  —  2)  (x  +  3) 
21 
~5 


dx 


a  8  27 

— —  x~\~  —  In  lx  —  21  H — —  In  lx  “h  3 1  C. 


* 

Now  suppose  that  x  +bx  +  c  doesn’t  factor.  Again  we  can  use  long  division  to  ensure  that  the  numer¬ 
ator  has  degree  less  than  2,  then  we  complete  the  square. 


Solution  The  quadratic  denominator  does  not  factor.  We  could  complete  the  square  and  use  a  trigono¬ 
metric  substitution,  but  it  is  simpler  to  rearrange  the  integrand: 


x  -|-  1 


x2  +  4x  +  8 


dx  — 


x-t-2 


x2  +  4x  +  8 


dx 


1 


x2  +  4x  +  8 


dx. 


The  first  integral  is  an  easy  substitution  problem,  using  u 


x  -\~2 


x2  +  4x  +  8 


1  f  du 
dx—  -  /  — 


x2  +4x+  8: 

1  ,  7 

-  In  |x2  +  4x  +  8 
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For  the  second  integral  we  complete  the  square: 


x2  +  4x  +  8  =  (x  +  2)2  +  4  =  4 


x  +  2 


+ 1  > 


making  the  integral 


T  T  •  X  +  2 

Using  u  —  ^  we  §et 


1  r  1 


4J  (+)  +i 


dx. 


If  1  J  1  f  2  J  1  /x  +  2 

4.1  +  i  4  J  w2  +  1  2  V  2 


The  final  answer  is  now 


X+l  1  i  2  oi  1  /x+2 

- - - -  dx  —  -  In  x  +  4x  +  8 - arctan  -  +  C. 

x2  +  4x  +  8  2  1  1  2  l  2 


Exercises  for  7.5 


Exercise  7.5.1 


4  —  x2 


dx 


Exercise  7.5.2 


4  —  x2 


dx 


Exercise  7.5.3 


1 


x2  +  10x  +  25 


dx 


Exercise  7.5.4 


4  —  x2 


dx 


Exercise  7.5.5 


4  +  x‘ 


dx 


Exercise  7.5.6 


1 


x2  +  10x  +  29 


dx 


Exercise  7.5.7 


4  +  x2 


dx 
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Exercise  7.5.8  f  ^ - dx 

J  x2  +  10x  +  2l 


Exercise  7.5.9  [  — ^ - -  dx 

J  2x2  —  x  —  3 


Exercise  7.5.10  /  ~  —  dx 

J  x2  +  3.v 
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We  have  now  seen  some  of  the  most  generally  useful  methods  for  discovering  antiderivatives,  and  there 
are  others.  Unfortunately,  some  functions  have  no  simple  antiderivatives.  In  such  cases,  if  the  value  of  a 
definite  integral  is  needed  it  will  have  to  be  approximated.  We  will  see  two  methods  that  work  reasonably 
well  and  yet  are  fairly  simple;  in  some  cases  more  sophisticated  techniques  will  be  needed. 

Of  course,  we  already  know  one  way  to  approximate  an  integral:  If  we  think  of  the  integral  as  comput¬ 
ing  an  area,  we  can  add  up  the  areas  of  some  rectangles.  While  this  is  quite  simple,  it  is  usually  the  case 
that  a  large  number  of  rectangles  is  needed  to  get  acceptable  accuracy.  A  similar  approach  is  much  better. 
We  approximate  the  area  under  a  curve  over  a  small  interval  as  the  area  of  a  trapezoid.  In  figure  7.1  we 
see  an  area  under  a  curve  approximated  by  rectangles  and  by  trapezoids;  it  is  apparent  that  the  trapezoids 
give  a  substantially  better  approximation  on  each  subinterval. 


Figure  7.1:  Approximating  an  area  with  rectangles  and  with  trapezoids. 


As  with  rectangles,  we  divide  the  interval  into  n  equal  subintervals  of  length  At.  A  typical  trapezoid  is 
pictured  in  figure  7.2;  it  has  area 


/(*i0 +/(*«+ 1) 
2 


At. 


If  we  add  up  the  areas  of  all  trapezoids  we  get 

/(^o)+/(u)Ax  |  f(xi)+f(x2) ^  |  |  f(xn-i)+f(xn)^ 

—  H - h/(x„_i)  +  ^2  ^  A-X- 

For  a  modest  number  of  subintervals  this  is  not  too  difficult  to  do  with  a  calculator;  a  computer  can  easily 
do  many  subintervals. 
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Figure  7.2:  A  single  trapezoid. 


In  practice,  an  approximation  is  useful  only  if  we  know  how  accurate  it  is;  for  example,  we  might  need 
a  particular  value  accurate  to  three  decimal  places.  When  we  compute  a  particular  approximation  to  an 
integral,  the  error  is  the  difference  between  the  approximation  and  the  true  value  of  the  integral.  For  any 
approximation  technique,  we  need  an  error  bound,  a  value  that  is  guaranteed  to  be  larger  than  the  actual 
error.  If  A  is  an  approximation  and  E  is  the  associated  error  bound,  then  we  know  that  the  true  value  of  the 
integral  is  between  A  —  E  and  A  +  E.  In  the  case  of  our  approximation  of  the  integral,  we  want  E  —  E  (At) 
to  be  a  function  of  At  that  gets  small  rapidly  as  At  gets  small.  Fortunately,  for  many  functions,  there  is 
such  an  error  bound  associated  with  the  trapezoid  approximation. 


Theorem  7.38:  Error  for  Trapezoid  Approximation 


Suppose  f  has  a  second  derivative  f"  everywhere  on  the  interval  [a,  b] ,  and  \  f"{x)  \<M  for  all  x  in 
the  interval.  With  Ax  =  (b  —  a)/n,  an  error  bound  for  the  trapezoid  approximation  is 


E{  At) 


b  —  a 


12 


M(  AFT 


(b  —  a)3 
12  n2 


M. 


Let’s  see  how  we  can  use  this. 


Example  7.39:  Approximate  an  Integral  With  Trapezoids 

Approximate  j 

ft  2 

'  e  x  dx  to  two  decimal  places. 

0 

Solution.  The  second  derivative  of  /  =  e~x~  is  ( 4x 2  —  2)e  A\  and  it  is  not  hard  to  see  that  on  [0, 1]  \f"(x)\ 
has  a  maximum  value  of  2,  thus  we  begin  by  estimating  the  number  of  subintervals  we  are  likely  to  need. 
To  get  two  decimal  places  of  accuracy,  we  will  certainly  need  E{  At)  <  0.005  or 


5.77 


AA 

< 

0.005 

7(200) 

< 

n2 

Jlf 

< 

n 

V  3 
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With  n  —  6,  the  error  bound  is  thus  1/63  <  0.0047.  We  compute  the  trapezoid  approximation  for  six 
intervals: 

(yy  +  /(l/6)  +  /(2/6)  +  •  •  •  +  /( 5/6)  +  ^  «  0.74512. 

So  the  true  value  of  the  integral  is  between  0.74512  —  0.0047  =  0.74042  and  0.74512  +  0.0047  =  0.74982. 
Unfortunately,  the  first  rounds  to  0.74  and  the  second  rounds  to  0.75,  so  we  can’t  be  sure  of  the  correct 
value  in  the  second  decimal  place;  we  need  to  pick  a  larger  n.  As  it  turns  out,  we  need  to  go  to  n  = 
12  to  get  two  bounds  that  both  round  to  the  same  value,  which  turns  out  to  be  0.75.  For  comparison, 
using  12  rectangles  to  approximate  the  area  gives  0.7727,  which  is  considerably  less  accurate  than  the 
approximation  using  six  trapezoids. 

In  practice  it  generally  pays  to  start  by  requiring  better  than  the  maximum  possible  error;  for  example, 
we  might  have  initially  required  E  (Ar)  <  0.001,  or 


12.91 


i(2)-l  <  0.001 

7(1000)  <  n2 
6 


Had  we  immediately  tried  n  —  13  this  would  have  given  us  the  desired  answer. 


X 


The  trapezoid  approximation  works  well,  especially  compared  to  rectangles,  because  the  tops  of  the 
trapezoids  form  a  reasonably  good  approximation  to  the  curve  when  At  is  fairly  small.  We  can  extend  this 
idea:  what  if  we  try  to  approximate  the  curve  more  closely  by  using  something  other  than  a  straight  line? 
The  obvious  candidate  is  a  parabola:  If  we  can  approximate  a  short  piece  of  the  curve  with  a  parabola  with 
equation  y  —  ax 2  +  bx  +  c,  we  can  easily  compute  the  area  under  the  parabola. 

There  are  an  infinite  number  of  parabolas  through  any  two  given  points,  but  only  one  through  three 
given  points.  If  we  find  a  parabola  through  three  consecutive  points  (x;-, /(+■)),  (x!+i,/(x,+i)),  (xj+2,/(x,-+2)) 
on  the  curve,  it  should  be  quite  close  to  the  curve  over  the  whole  interval  [x,-,x,-+ 2],  as  in  Figure  7.3.  If 
we  divide  the  interval  [a,  b\  into  an  even  number  of  subintervals,  we  can  then  approximate  the  curve  by 
a  sequence  of  parabolas,  each  covering  two  of  the  subintervals.  For  this  to  be  practical,  we  would  like  a 
simple  formula  for  the  area  under  one  parabola,  namely,  the  parabola  through  (xj,/(x;)),  (x,+  i  ,/(xi+i )), 
and  (xJ+2,/(x;+2)).  That  is,  we  should  attempt  to  write  down  the  parabola  y  —  ax2  +  bx  +  c  through  these 
points  and  then  integrate  it,  and  hope  that  the  result  is  fairly  simple.  Although  the  algebra  involved  is 
messy,  this  turns  out  to  be  possible.  The  algebra  is  well  within  the  capability  of  a  good  computer  algebra 
system  like  Sage,  so  we  will  present  the  result  without  all  of  the  algebra. 

To  find  the  parabola,  we  solve  these  three  equations  for  a,  b,  and  c: 

/(x,-)  —  a(x,-+ 1  —  Ar)2  +  &(x,-+i  —  Ar)  +  c 
/(x/+ 1)  =  a(xi+i)2  +  b(xi+l)  +  c 
/(+■+ 2)  =  a(Xf+i  +  Ar)2  +  b(xi+ 1  +  Ar)  +  c 


Not  surprisingly,  the  solutions  turn  out  to  be  quite  messy.  Nevertheless,  Sage  can  easily  compute  and 
simplify  the  integral  to  get 


rXi- |_i+Aa 

Ax 


ax~  +  bx  +  c  dx 


y  (/(+')  +4/(++i)  +f(xi+2)). 
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Now  the  sum  of  the  areas  under  all  parabolas  is 

\x. 

y  (/(*o)  +4/(*i)  +f(x2)  +  f(x2)  +  4/(x3)  +  /(x4)  H - b/(x„_2)  +  4/(x„_i)  +  /(*„))  = 

Ax 

y  (/Oo)  +4/(xi)  +  2/(x2)  +  4/(x3)  +  2/(x4)  H - f2/(x„_2)  +4/(xn_i)  +/(x„)). 

This  is  just  slightly  more  complicated  than  the  formula  for  trapezoids;  we  need  to  remember  the  alter¬ 
nating  2  and  4  coefficients,  and  that  the  interval  must  be  divided  into  an  even  number  of  subintervals.  This 
approximation  technique  is  referred  to  as  Simpson’s  Rule. 


%i  -9'+ 1  -'9-1-2 


Figure  7.3:  A  parabola  (dashed)  approximating  a  curve  (solid). 


As  with  the  trapezoid  method,  this  is  useful  only  with  an  error  bound: 


Theorem  7.40:  Error  for  Simpson’s  Approximation 


Suppose  f  has  a  fourth  derivative  ft4>  everywhere  on  the  interval  [a,  b] ,  and  \fiA]  (x)  |  <M  for  all  x 
in  the  interval.  With  A x  —  (b  —  a)  / n,  an  error  bound  for  Simpson’s  approximation  is 


E(  Ax) 


180 


-M{AxY 


(b  —  a)5 
180  n4 


M. 


Example  7.41:  Approximate  an  Integral  With  Parabolas 

Let  us  again  approximate  j 

ft  7 

'  e  x  dx  to  two  decimal  places. 

0 

Solution.  The  fourth  derivative  of  /  =  e  x  is  (16x4  —  48x2  +  1 2)e  ~  A  ;  on  [0, 1]  this  is  at  most  12  in  absolute 
value.  We  begin  by  estimating  the  number  of  subintervals  we  are  likely  to  need.  To  get  two  decimal  places 
of  accuracy,  we  will  certainly  need  E{  Ax)  <  0.005,  but  taking  a  cue  from  our  earlier  example,  let’s  require 
E (Ax)  <0.001: 


1 

180 


(12)4 

200 


200 


<  0.001 


<  n 


4 


2.86  ph 


3 


<  n 
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So  we  try  n  —  4,  since  we  need  an  even  number  of  subintervals.  Then  the  error  bound  is  12/180/44  < 
0.0003  and  the  approximation  is 

(/(0)  +4/(l/4)  +  2/(l/2)  +4/(3/4)  +/(1))^  «  0.746855. 

So  the  true  value  of  the  integral  is  between  0.746855  —  0.0003  =  0.746555  and  0.746855  +  0.0003  = 
0.7471555,  both  of  which  round  to  0.75.  X 


Exercises  for  7.6 


In  the  following  problems,  compute  the  trapezoid  and  Simpson  approximations  using  4  subintervals,  and 
compute  the  error  bound  for  each.  (Finding  the  maximum  values  of  the  second  and  fourth  derivatives  can 
be  challenging  for  some  of  these;  you  may  use  a  graphing  calculator  or  computer  software  to  estimate  the 
maximum  values.) 

Exercise  7.6.1  J  xdx 


Exercise  7.6.2  f  x2  dx 

Jo 


Exercise  7.6.3  /  jc  dx 


f3  1 

Exercise  7.6.4  /  -dx 

J 1  x 


Exercise  7.6.5 


1 


dx 


1  +x2 

Exercise  7.6.6  I  x\/l  +xdx 
Jo 


Exercise  7.6.7 


'l  1  +  x 


dx 


Exercise  7.6.8  f  \/x3  + 1  dx 
Jo 

Exercise  7.6.9  [  \/x^  + 1  dx 
Jo 

/*4 

Exercise  7.6.10  j  y/l  +  1  /xdx 
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Exercise  7.6.11  Using  Simpson ’s  rule  on  a  parabola  f(x),  even  with  just  two  subintervals,  gives  the  exact 
value  of  the  integral,  because  the  parabolas  used  to  approximate  f  will  be  f  itself.  Remarkably,  Simpson’s 
rule  also  compu  tes  the  integral  of  a  cubic  function  f(x)  —  ax 3  +  bx2  +  cx  +  d  exactly.  Show  this  is  true  by 
showing  that 

J  /(*)  dx  =  X23 .  *°  (f(xo)  +  4/((x0  +x2)/2)  +  f(x2)). 

This  does  requ  ire  a  bit  of  messy  algebra,  so  you  may  prefer  to  use  Sage. 


7.7  Improper  Integrals 


Recall  that  the  Fundamental  Theorem  of  Calculus  says  that  if  /  is  a  continuous  function  on  the  closed 
interval  [a,b\,  then 

rb 


f(x)  dx  —  F{x) 


=  F(b)  —F(a), 


where  F  is  any  antiderivative  of  /. 


Both  the  continuity  condition  and  closed  interval  must  hold  to  use  the  Fundamental  Theorem  of 

(b 

Calculus,  and  in  this  case,  /  fix)  dx  represents  the  net  area  under  f{x)  from  a  to  b: 

Ja 


f  0) 


X 


We  begin  with  an  example  where  blindly  applying  the  Fundamental  Theorem  of  Calculus  can  give  an 
incorrect  result. 


Example  7.42:  Using  FTC 

Explain  why  j 

f1  1 

dx  is  not  equal  to  —2. 

-1  xL 

Solution  Here  is  how  one  might  proceed: 


1  \dx  = 
-1  xz 


x  2  dx  —  —x  1 


=  -2 


However,  the  above  answer  is  WRONG!  Since  /(x)  =  1/x2  is  not  continuous  on  [—1,  1],  we  cannot 
directly  apply  the  Fundamental  Theorem  of  Calculus.  Intuitively,  we  can  see  why  —2  is  not  the  correct 
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answer  by  looking  at  the  graph  of  f(x)  =  1  /x2  on  [—  1,  1] .  The  shaded  area  appears  to  grow  without  bound 
as  seen  in  the  figure  below. 

y 


X 

Formalizing  this  example  leads  to  the  concept  of  an  improper  integral.  There  are  two  ways  to  extend 
the  Fundamental  Theorem  of  Calculus.  One  is  to  use  an  infinite  interval,  i.e.,  (— °°,b\  or  (—00,00). 

The  second  is  to  allow  the  interval  [a,b\  to  contain  an  infinite  discontinuity  of  fix).  In  either  case, 
the  integral  is  called  an  improper  integral.  One  of  the  most  important  applications  of  this  concept  is 
probability  distributions. 

To  compute  improper  integrals,  we  use  the  concept  of  limits  along  with  the  Fundamental  Theorem  of 
Calculus. 


If  the  limit  exists  and  is  a  finite  number,  we  say  the  improper  integral  converges.  Otherwise,  we  say 
the  improper  integral  diverges. 

To  get  an  intuitive  (though  not  completely  correct)  interpretation  of  improper  integrals,  we  attempt  to 

r-oo 

analyze  /  f(x)  dx  graphically.  Here  assume  fix)  is  continuous  on  [a,°°): 

J  a 
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We  let  R  be  a  fixed  number  in 
integral: 


[a,°°).  Then  by  taking  the  limit  as  R  approaches  °°,  we  get  the  improper 


f(x)dx:=  lim 
R — . 


f(x)  dx. 


We  can  then  apply  the  Fundamental  Theorem  of  Calculus  to  the  last  integral  as  f(x)  is  continuous  on  the 
closed  interval  [a,R], 

We  next  define  the  improper  integral  for  the  interval  (— °°,  °°). 


r  i 

Definition  7.44:  Definitions  for  Improper  Integrals 

If  both  j 

ra 

'  f(x)  dx  and  j 

—  oo  J 

poo 

'  fix)  dx  are  convergent,  then  the  improper  integral  of  f  over  (— oo,oo)  is: 

a 

poo  pa  poo 

/  f(x)dx:=  /  f(x)  dx  +  /  f(x)dx 

J —oo  J —oo  Ja 

The  above  definition  requires  both  of  the  integrals 

/  f(x)  dx  and 


f(x)  dx 


/oo  pa  poo 

f(x)  dx  to  also  be  convergent.  If  either  of  /  f(x)  dx  or  /  f(x)  dx  is  divergent, 

-oo  J—  oo  Ja 

poo 

then  so  is  /  f(x)  dx. 


Example  7.45:  Improper  Integral 

Determine  whether  j 

,°°  i 

'  -  dx  is  convergent  or  divergent. 

l  x 

Solution,  Using  the  definition  for  improper  integrals  we  write  this  as: 


1  fR  1 

/  -dx  —  lim  /  -dx  —  lim  In  be 

'\  X  X  R^°° 


=  lim  ln|/?|  —  In  1 1 1 

R—>  oo 


=  lim  In  \R\  —  +o° 


R—>o 


Therefore,  the  integral  is  divergent. 


Example  7.46:  Improper  Integral 

Determine  whether  j 

f°°  ? 

'  x sin(x“)  dx  is  convergent  or  divergent. 

— oo 

2  r°  , 

Solution.  We  must  compute  both  /  xsin(jr)  dx  and  /  x sin (x“ )  dx.  Note  that  we  don’t  have  to  split  the 

JO  J —oo 

integral  up  at  0,  any  finite  value  a  will  work.  First  we  compute  the  indefinite  integral.  Let  u  —  x2,  then 
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du  =  2x  dx  and  hence, 


I xsin (x2)dx  —  ^  j \m(u)du  —  —  ^cos(jr)  +C 
Using  the  definition  of  improper  integral  gives: 


[°°  2  fR 

/  xsin(x~)r/x  =  lim  /  xsin (x~)dx=  lim 

J  0  R—t° o  J  o  R — ^°° 


1  9  1  ^  1  1 

-cos(x2)  = -- lim  cos(7?2)  +  - 

2  J  0  2 1?— »°°  2 


This  limit  does  not  exist  since  cosx  oscillates  between  —1  and  +1.  In  particular,  cosx  does  not  approach 
any  particular  value  as  x  gets  larger  and  larger.  Thus,  /  xsin(x  )  dx  diverges,  and  hence,  /  xsin(x  )  dx 

JO  j  —  oo 

diverges. 

When  there  is  a  discontinuity  in  [a,b\  or  at  an  endpoint,  then  the  improper  integral  is  as  follows. 


If  the  limit  above  exists  and  is  a  finite  number,  we  say  the  improper  integral  converges.  Otherwise,  we 
say  the  improper  integral  diverges. 

When  there  is  a  discontinuity  in  the  interior  of  [a,b\,  we  use  the  following  definition. 


fb 

Again,  we  can  get  an  intuitive  sense  of  this  concept  by  analyzing  /  /(x)  dx  graphically.  Here  assume 

J  a 
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/(x)  is  continuous  on  (a,b\  but  discontinuous  at  x  —  a: 


We  let  R  be  a  fixed  number  in  (a,b).  Then  by  taking  the  limit  as  R  approaches  a  from  the  right,  we  get 
the  improper  integral: 

j-b  rb 

/  f(x)dx:=  lim  /  f(x)dx. 

J  a  R — J  R 

Now  we  can  apply  FTC  to  the  last  integral  as  /(x)  is  continuous  on  [R,b\. 


Example  7.49:  A  Divergent  Integral 

Determine  if  j 

1 

'  — y  c/x  is  convergent  or  divergent. 

-1  xz 

Solution  The  function  /(x)  —  1  /x2  has  a  discontinuity  at  x  —  0,  which  lies  in  [—1,1].  We  must  compute 


f°  1 


"!  1 


1 


-*dx  and  /  -^dx.  Let’s  start  with  /  ~^dx: 


-l  x- 


/  0  x 


)  0  x 


-1  1 


"1  1 


1 


—^rdx—  lim  /  -^rdx—  lim  — 


/  o  x~ 


r^o+Jr  x2  ’  x 


l 


1 


=  -1  +  lim  - 
R  R^  0+  R 


f 1  1  1  1 

which  diverges  to  +°°.  Therefore,  /  —^dx  is  divergent  since  one  of  /  -y  dx  and  /  —  dx  is  divergent. 

J- 1  xz  .7-1  xz  7o  xz 


Example  7.50:  Integral  of  the  Logarithm 

Determine  if  j 

rl 

'  In  xdx  is  convergent  or  divergent.  Evaluate  it  if  it  is  convergent. 

0 

Solution.  Note  that  f(x)  —  Inx  is  discontinuous  at  the  endpoint  x  =  0.  We  first  use  integration  by  parts  to 
compute  /  In xdx.  We  let  u  —  lnx  and  dv  =  dx.  Then  du  —  (1  /x)dx,  v  —  x,  giving: 


Inxr/x 


xlnx  — 


x  -  -  dx 
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Now  using  the  definition  of  improper  integral  for 


In  xdx: 


In  xdx 


lim  /  In  xdx 
R^  0+  Jr 


l 


lim  (xlnx  —  x)  =  —  1  —  lim  (RlnR)  +  lim  R 

R^  o+  r,  r->  0+ 


Note  that  lim  R  —  0.  We  next  compute  lim  (R  In  R ) .  First,  we  rewrite  the  expression  as  follows: 

R-+0+  R-^  0+ 


lim  (RlnR) 

x^0+ 


In  R 
lim  — — . 

r^o+  l/R 


Now  the  limit  is  of  the  indeterminate  type  (— °°) / (°o)  and  THopital’s  Rule  can  be  applied. 


lim  (RlnR) 

R^0+ 


InR 
lim  — — 
r-^  o+  l/R 


r  l/R 
lim  - 

R^o+  —  1  /R2 


r2 

lim - =  lim  (-R)  =  0 

fl->0+  R  R^o+ 


Thus,  lim  (RlnR)  =  0.  Thus 


lnx  dx  =  —  1, 


and  the  integral  is  convergent  to  —  1 . 

Graphically,  one  might  interpret  this  to  mean  that  the  net  area  under  lnx  on  [0, 1]  is  —  1  (the  area  in  this 
case  lies  below  the  x-axis). 


* 


Example  7.51:  Integral  of  a  Square  Root 

Determine  if  j 

f4  dx  ...... 

'  —==  is  convergent  or  divergent.  Evaluate  it  if  it  is  convergent. 
o  v4  —  x 

Solution.  Note  that  ^  is  discontinuous  at  the  endpoint  x  =  4.  We  use  a  w-substitution  to  compute 
/  We  let  u  —  4  —  x,  then  du  =  —  dx,  giving: 


dx 


du 

VJi 


y/4—X 
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=  j  —u  l/2du 

=  -2  (w)1/2+C 
=  -2V4 -x  +  C 


Now  using  the  definition  of  improper  integrals  for 


dx 

\/4  —  x 


r  _  ^ 

lim  (— 2\/4  —  x)  =  lim  —  2\/4  —  R  +  2\f4  —  4 

4~  q  4~ 


X 


Example  7.52:  Improper  Integral 

Determine  if  j 

f  ^  dx 

1  - —  is  convergent  or  divergent.  Evaluate  it  if  it  is  convergent. 

1  (x-1) 1/3 

Solution.  Note  that  /  (x)  = 


find 


dx 


(x-1) 


1/3 


is  discontinuous  at  the  endpoint  x  =  1.  We  first  use  substitution  to 


(x-1) 


1/3 


.  We  let  u  —  x  —  1 .  Then  du  —  dx,  giving 


dx 


(x  —  1 


,1/3 


1 2±  =  J u-'^du  =  3-u2'3  +C  =  3-(x- l)2'3 +C. 


r  2  dx 

Now  using  the  definition  of  improper  integral  for  /  - ^  : 

71  (x-1) 1/3 


dx 


—  lim 


dx 


—  lim  -  (x  —  1) 


2/3 


1  (x-1) 1/3  R-+1+JR  (x-1) 1/3  ^1+2 


=  *-  lim  ^(i?-l)2/3  =  ^, 
2  R-xi+  2  2 


and  the  integral  is  convergent  to  / .  Graphically,  one  might  interpret  this  to  mean  that  the  net  area  under 


(x-1) 


I73  °n  [1,2]  is  f . 
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X 

The  following  test  allows  us  to  determine  convergence/divergence  information  about  improper  inte¬ 
grals  that  are  hard  to  compute  by  comparing  them  to  easier  ones.  We  state  the  test  for  [a,°°),  but  similar 
versions  hold  for  the  other  improper  integrals. 


Informally,  (i)  says  that  if  f(x)  is  larger  than  g(x),  and  the  area  under  f(x)  is  finite  (converges),  then 
the  area  under  g(x)  must  also  be  finite  (converges).  Informally,  (ii)  says  that  if  f(x)  is  larger  than  g(x),  and 
the  area  under  g(x)  is  infinite  (diverges),  then  the  area  under  / (x)  must  also  be  infinite  (diverges). 
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Example  7.53:  Comparison  Test 


Show  that 


cos2* 


/  2  *z 


dx  converges. 


Solution  We  use  the  comparison  test  to  show  that  it  converges.  Note  that  0  <  cos2*  <  1  and  hence 

o<^<! 

*z  *z 

r°°  1 

Thus,  taking  /(*)  =  1  /*2  and  g(*)  =  cos2*/*2  we  have  /(*)  >  g(*)  >  0.  One  can  easily  see  that  /  —^dx 

J  2  xz 


converges.  Therefore, 


cos2* 


12  *z 


dx  also  converges. 


Exercises  for  Section  7.7 


r°°  l 

Exercise  7.7.1  Determine  whether  /  dx  is  convergent  or  divergent. 

J  l  xz 

r  1 

Exercise  7.7.2  Determine  whether  /  — - —  dx  is  convergent  or  divergent. 

Je  *vln* 

/»oo 

Exercise  7.7.3  Evaluate  the  improper  integral  /  e~2x  dx. 

Jo 

re  1 

Exercise  7.7.4  Determine  if  /  — — dx  is  convergent  or  divergent.  Evaluate  it  if  it  is  convergent. 

J i  *(ln*)“ 


p  00  /  \ 

Exercise  7.7.5  Show  that  J  e~x  sin2  y—J  dx  converges. 

poo  J  poo 

Exercise  7.7.6  Evaluate  /  — -  dx  and  /  — -  dx. 


\2  + 1 


Doo  x2  + 1 


Exercise  7.7.7  Determine  whether  the  following  improper  integrals  are  convergent  or  divergent.  Evaluate 
those  that  are  convergent. 


(a)  /( 


1 


0  *2  + 1 


dx 


dx 


(b)  x2  +  l 

(c)  J(“  e~r(cos*  +  sin*)  dx.  [Hint:  What  is  the  derivative  of  —  e~x  cosxl] 


300  Techniques  of  Integration 


(d)  J^2  sec2xdx 

(e)  fo  (4  —  x)2/5 


dx 


Exercise  7.7.8  Prove  that  the  integral  ff 


—  dx  is  convergent  if  p  >  1  and  divergent  if  0  <  p  <  1. 
xP 


Exercise  7.7.9  Suppose  that  p  >  0.  Find  all  values  of  p  for  which  Jq 


Exercise  7.7.10  Show  that  jf 


sin2x 
x(a/t+  1) 


dx  converges. 


—  dx  converges. 
xP 


7.8  Additional  exercises 


These  problems  require  the  techniques  of  this  chapter,  and  are  in  no  particular  order.  Some  problems  may 
be  done  in  more  than  one  way. 


Exercise  7.8.1 


Exercise  7.8.2 


I  (t  +  4)3dt 
I  t{t2-9)3/2clt 


Exercise  7.8.3  (el  +16 )te*  dt 


Exercise  7.8.4  smtcosltdt 


Exercise  7.8.5  tantsecztdt 


Exercise  7.8.6 


2/  +  1 
t2  + 1  +  3 


dt 


Exercise  7.8.7 


Exercise  7.8.8 


Exercise  7.8.9 


1 


dt 


t(t2-  4) 

(25  -t2) 3/2 
cos3t 


dt 


\/sin3t 


dt 


Exercise  7.8.10 

Exercise  7.8.11 

Exercise  7.8.12 

Exercise  7.8.13 

Exercise  7.8.14 

Exercise  7.8.15 

Exercise  7.8.16 

Exercise  7.8.17 

Exercise  7.8.18 

Exercise  7.8.19 

Exercise  7.8.20 

Exercise  7.8.21 

Exercise  7.8.22 

Exercise  7.8.23 

Exercise  7.8.24 

Exercise  7.8.25 

Exercise  7.8.26 


tsec'tdt 


J  y/e1  + 1 
I  cos4  tdt 
f  1 


dt 


t2  +  3 / 

1 


dt 


J  t2V  1  +t 2 

r  sec2/ 

,/  (1+tan/)3 


dt 


dt 


V*  \J  - hi  dt 

el  sin  tdt 


1 


J  t(9  +  4t2) 
arctan  2 1 


dt 


dt 


dt 


1+4 12 
t 

t2  +  2t-3 
sin3  /  cos4  tdt 

^  dt 

t2-6t  +  9 


1 


dt 


J  /(In/)2 
j  t(lnt)2dt 
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Exercise  7.8.27  Jt3e*dt 


r  t+ 1 

Exercise  7.8.28  /  - dt 

J  t2  +  t-  1 


8.  Applications  of  Integration 


8.1  Distance,  Velocity,  Acceleration 


We  recall  a  general  principle  that  will  later  be  applied  to  distance- velocity-acceleration  problems,  among 

fb 

other  things.  If  F(u)  is  an  anti-derivative  of  f(u),  then  /  f{u)  du  —  F(b )  —  F(a).  Suppose  that  we  want 

J a 

to  let  the  upper  limit  of  integration  vary,  i.e.,  we  replace  b  by  some  variable  x.  We  think  of  a  as  a  fixed 
starting  value  xq.  In  this  new  notation  the  last  equation  (after  adding  F (a)  to  both  sides)  becomes: 

F(x)  —  F(xq)  +  f  f(u)du. 

Jx  0 

Here  u  is  the  variable  of  integration,  called  a  “dummy  variable,”  since  it  is  not  the  variable  in  the  function 
F(x).  In  general,  it  is  not  a  good  idea  to  use  the  same  letter  as  a  variable  of  integration  and  as  a  limit  of 

integration.  That  is,  /  f(x)dx  is  bad  notation,  and  can  lead  to  errors  and  confusion. 

Jx  0 

An  important  application  of  this  principle  occurs  when  we  are  interested  in  the  position  of  an  object  at 
time  t  (say,  on  the  x-axis)  and  we  know  its  position  at  time  to.  Let  s(l  )  denote  the  position  of  the  object 
at  time  t  (its  distance  from  a  reference  point,  such  as  the  origin  on  the  x-axis).  Then  the  net  change  in 
position  between  to  and  t  is  s(t)  —  s(to).  Since  s(t)  is  an  anti-derivative  of  the  velocity  function  v(t),  we 
can  write 

s(t)  —  s(to)  +  /  v(u)du. 

Jt0 

Similarly,  since  the  velocity  is  an  anti-derivative  of  the  acceleration  function  a(t),  we  have 


v(0 


v(fo)  +  /  a(u)du. 


Example  8.1:  Constant  Force 


Suppose  an  object  is  acted  upon  by  a  constant  force  F .  Find  v(t  )  and  s(t) . 


Solution  By  Newton’s  law  F  =  ma,  so  the  acceleration  is  F/m,  where  m  is  the  mass  of  the  object.  Then 
we  first  have 


ft  F  F 

v(t)  =  v(to)+  /  —du  —  v  oH - u 

Jt0  m  in 


to 


=  Vo+-(t-t0), 

m 


using  the  usual  convention  vq  =  v(to).  Then 


s(t)  =  s(t0 )  +  J‘  (v0  +  J(m  -  to))  du  =  s0  +  (v0m  +  ^(u-  to)2)  | 
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=  so  +  Vo0  -  *0 )  +  £i(t-  to)2. 


For  instance,  when  F/tn  —  —g  is  the  constant  of  gravitational  acceleration,  then  this  is  the  falling  body 
formula  (if  we  neglect  air  resistance)  familiar  from  elementary  physics: 


S0  +  v0(t-t0)-^(t-t0)2, 

or  in  the  common  case  that  to  —  0, 

S0  +  V0t-  |/2. 


X 


Recall  that  the  integral  of  the  velocity  function  gives  the  net  distance  traveled.  If  you  want  to  know 
the  total  distance  traveled,  you  must  find  out  where  the  velocity  function  crosses  the  t- axis,  integrate 
separately  over  the  time  intervals  when  v(t)  is  positive  and  when  v(t  )  is  negative,  and  add  up  the  absolute 
values  of  the  different  integrals.  For  example,  if  an  object  is  thrown  straight  upward  at  19.6  m/sec,  its 
velocity  function  is  v(t)  —  —9.8 1  +  19.6,  using  g  =  9.8  m/sec  for  the  force  of  gravity.  This  is  a  straight  line 
which  is  positive  for  t  <  2  and  negative  for  t  >  2.  The  net  distance  traveled  in  the  first  4  seconds  is  thus 


-9.8/+19.6)d/ 


0, 


while  the  total  distance  traveled  in  the  first  4  seconds  is 


(—9.8/  +  I9.6)dt  + 


(-9.8/  +  19.6)  dt 


19.6+  I  —  19.61  =  39.2 


meters,  19.6  meters  up  and  19.6  meters  down. 


Example  8.2:  Net  and  Total  Distance 


The  acceleration  of  an  object  is  given  by  a(t )  —  cos(/r/),  and  its  velocity  at  time  t  —  0  is  1/(27 r). 
Find  both  the  net  and  the  total  distance  traveled  in  the  first  1.5  seconds. 


Solution  We  compute 


ft  1  1 

v(/)  =  v(0)  +  /  cos(7 iu)du  —  - — I —  sin(7Tw) 
Jo  2k  k 


1  A  .  , 

=  -(-  +  sm(s,)). 


The  net  distance  traveled  is  then 


/'3/2  1/1  \ 
s(3/2)  —5(0)  =  J  —  (  -  +  sin(7T/)  J  dt 


lft  1  ,  , 

—  - COS  (Kt) 

n\2n  y  1 

3  1 

4k  +  K~ 


3/2 

0 


0.340  meters. 
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To  find  the  total  distance  traveled,  we  need  to  know  when  (0.5  +  sin(/r/))  is  positive  and  when  it  is  negative. 
This  function  is  0  when  sin(7T/)  is  —0.5,  i.e.,  when  Kt  —  Ik/6,  \\k/6,  etc.  The  value  Tit  —  Ik/6,  i.e., 
t  =  7/6,  is  the  only  value  in  the  range  0  <  t  <  1.5.  Since  v(z)  >  0  for  t  <  7/6  and  v(/)  <  0  for  t  >  7/6,  the 
total  distance  traveled  is 


-  +  sin(/r/)  )  dt  +  f 
2  /  Ji 


c3/2  i 

ll/6  K 


-  +  sin  (nt) 


dt 


1 

K 

1 

K 


'l  1  tn  13  7  1 

-  +  -  cos(7jt/6)  +  -\ -  cos(7^/6) 


7  1  y/3  1 

- 1 - 1 - 

12  K  2  K 


3  _7_  J_  \/3 
4~l2  +  7T^”' 


~  0.409  meters. 


X 


Exercises  for  Section  8.1 


Exercise  8.1.1  An  object  moves  so  that  its  velocity  at  time  t  is  v(t)  —  —9.8 1  +  20  m/s.  Describe  the  motion 
of  the  object  between  t  —  0  and  t  —  5,  find  the  total  distance  traveled  by  the  object  during  that  time,  and 
find  the  net  distance  traveled. 

Exercise  8.1.2  An  object  moves  so  that  its  velocity  at  time  t  is  v(t)  —  sin/.  Set  up  and  evaluate  a  single 
definite  integral  to  compute  the  net  distance  traveled  between  t  —  0  and  t  =  2k. 

Exercise  8.1.3  An  object  moves  so  that  its  velocity  at  time  t  is  v(t )  =  1  +  2  sin/  m/s.  Find  the  net  distance 
traveled  by  the  object  between  l  —  0  and  t  =  2k,  and  find  the  total  distance  traveled  during  the  same 
period. 

Exercise  8.1.4  Consider  the  function  f(x)  =  (x  +  2)  (x  +  1 )  (x  —  1 )  (x  —  2)  on  [—2,2],  Find  the  total  area 
between  the  curve  and  the  x-axis  ( measuring  all  area  as  positive). 

Exercise  8.1.5  Consider  the  function  f{x)  —  x2  —  3x  +  2  on  [0,4].  Find  the  total  area  between  the  cur\>e 
and  the  x-axis  (measuring  all  area  as  positive). 

Exercise  8.1.6  Evaluate  the  three  integrals: 

r3  p4  r3 

A—  (— x2  +  9  )dx  (—  x2  +  9)dx  C—  (—x2  +  9  )dx, 

Jo  Jo  J  4 


and  verify  that  A  —  B  +  C. 
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8.2  Area  Between  Curves 


We  have  seen  how  integration  can  be  used  to  find  an  area  between  a  curve  and  the  x-axis.  With  very  little 
change  we  can  find  some  areas  between  curves;  indeed,  the  area  between  a  curve  and  the  x-axis  may  be 
interpreted  as  the  area  between  the  curve  and  a  second  “curve”  with  equation  y  —  0. 

Suppose  we  would  like  to  find  the  area  below  /(x)  —  —  x2  +  4x  +  3  and  above  g(x)  =  — x3  +  lx2  — 
10x  +  5  over  the  interval  1  <  x  <  2.  We  can  approximate  the  area  between  two  curves  by  dividing  the  area 
into  thin  sections  and  approximating  the  area  of  each  section  by  a  rectangle,  as  indicated  in  figure  8.1.  The 
area  of  a  typical  rectangle  is  Ax(/(x;)  —  g(x,)),  so  the  total  area  is  approximately 


n—  1 


E  (/(*i) -£(*«)) A*- 

(=0 


This  is  exactly  the  sort  of  sum  that  turns  into  an  integral  in  the  limit,  namely  the  integral 


Then 


— x2  T  4x  T  3 )  —  ( — x3  T  lx' " 


10x  +  5)  dx 


49 

12' 


10 


5 


0 


0 


2 


3 


Figure  8.1:  Approximating  area  between  curves  with  rectangles. 


This  procedure  can  informally  be  thought  of  as  follows. 


Area  Between  Two  Curves 

Area  —  j 

fh 

'  (top  curve)  —  (bottom  curve)  dx,  a<x<b. 

a 
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More  formally,  the  area  A  of  the  region  bounded  by  the  curves  y  —  f(x)  and  y  =  g(x)  and  the  lines 
x  —  a  and  x  —  b  is: 

A=  [  \f(x)~g(x)\dx. 


Example  8.3:  Area  between  Curves 


Find  the  area  between  f  (x)  —  —x2  +  4x  and  g(x)  —  x2  -  6x  +  5 ;  the  curves  are  shown  in  figure  8.2. 


Solution.  Here  we  are  not  given  a  specific  interval,  so  it  must  be  the  case  that  there  is  a  “natural”  region 
involved.  Since  the  curves  are  both  parabolas,  the  only  reasonable  interpretation  is  the  region  between  the 
two  intersection  points,  which  can  be  computed  as: 

5  ±  \/f5 

2  ' 


If  we  let  a  —  (5  —  \/l5)/2  and  b  =  (5  +  a/15)/2,  the  total  area  is 


f  —  x2  +  4x—  (x2  —  6x-\-5)dx  —  f 
J  a  J a 


— 2xl  4-  \0x  —  5dx 
b 


2x3  2 

— —  +  5x~  —  5x 


=  5VT5. 


after  a  bit  of  simplification. 


X 
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Some  general  guidelines  to  compute  the  area  between  two  curves  follows. 


Guidelines  for  Area  Between  Two  Curves 


1.  Find  the  intersection  points. 

2.  Draw  a  sketch  of  the  two  curves. 

3.  Using  the  sketch  determine  which  curve  is  the  top  curve  and  which  curve  is  the  bottom  curve. 
You  may  need  to  split  the  area  up  into  multiple  regions  if  the  curves  intersect  multiple  times 
in  [a,  b] . 

4.  Put  the  above  information  into  the  appropriate  formula  (once  for  each  region): 

rb 

Area  =  /  (top  curve)  —  (bottom  curve)  dx,  a<x<b. 

J a 

5.  Evaluate  the  integral  using  the  Fundamental  Theorem  of  Calculus  (you  should  get  a  positive 
number  representing  an  area). 


Solution.  The  points  of  intersection  of  y  —  x2  and  y  =  ^[x  are 

X2  —  yfx  — >  X4  —  X  X4—X  —  0  x(x3  —  1)=0. 
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Thus,  either  x—Oovx—  1.  Sketching  the  curves  gives: 


The  area  we  want  to  compute  is  the  shaded  region.  Since  the  top  curve  changes  at  x  =  1,  we  need  to  use 
the  formula  twice.  For  A\  we  have  a  =  0,  b  —  1,  the  top  curve  is  y  =  \fx  and  the  bottom  curve  is  v  =  x2. 
For  A2  we  have  a  =  1,  b  =  2,  the  top  curve  is  y  —  x2  and  the  bottom  curve  is  y  —  y/x. 


Area  =  Aj  +  A2 
For  the  first  integral  we  have: 


Thus, 


Area= M?3't3/2)|1 


10  — 4\/2 
3 


X 


Example  8.5:  Area  Between  Sine  and  Cosine 


Determine  the  area  enclosed  byy  —  sinv  and  y  =  cosx  on  the  interval  [0,  2k]. 


Solution.  The  curves  y  —  sinx  and  y  —  cosx  intersect  when: 

K 

sinx  =  cosx  — >  tanx  =1  — *  x  —  —  +  nk,  k  an  integer. 

We  have  the  following  sketch: 
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The  area  we  want  to  compute  is  the  shaded  region.  The  top  curve  changes  at  x  =  n/A  and  x  —  5k/4,  thus, 
we  need  to  split  the  area  up  into  three  regions:  from  0  to  tt/4;  from  k/4  to  5tt/4;  and  from  5tt/4  to  2k. 


Area 


(cosx  —  sinx)  dx  + 

7t/4 


,  5tt 
4 


n2n 

(sinx  —  cosx)  dx+  (cosx- 


(sinx  +  cosx) 


+  i 


cosx  —  sinx 


o 


5tt/4 


+  (sinx  +  cosx) 


71/4 


(y/2-  l)  +  (Vl  +  Vl)  +  (l  +  Vl) 

As/l 


sinx)  dx 

2  K 

5k/4 


X 


Sometimes  the  given 
following. 


curves  are  not  functions  of  x.  In  this  instances,  it  may  be  more  useful  to  use  the 


The  area  A  of  the  region  bounded  by  the  curves  x  =  f(y)  and  x  =  g(y)  and  the  lines  y  —  c  and  y  —  d  is: 

A  =  Jc  \f{y)-g(y)\dy. 

Informally  this  can  be  thought  of  as  follows: 


Area  Between  Two  Curves 

Area  =  j 

fd 

'  (right  curve)  —  (left  curve)  dy,  c  <y  <d. 

C 

Solution.  Note  that  x  —  y2  and  x  =  8  intersect  when: 

y2  =  8  -> 


y  =  ±v^8  —> 


y  —  ±2\/2 
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Sketching  the  two  curves  gives: 


From  the  sketch  c  =  —  2y/2,  d  =  2\fl,  the  right  curve  is  x  —  8  and  the  left  curve  is  x  —  y2. 

2V2 

-2V2 


rd  f2^2  9  /  1 

Area  =  /  [right  -  left]  dy  —  /  (8—  y2)dy  —  (  8 v  -  -y 

Jc  J— 2\f2  \  3 


8(2v/2)-^(2v/2)3  -  8(-2a/2)-^(-2a/2)2 


64\/2 


Exercises  for  Section  8.2 

Find  the  area  bounded  by  the  curves. 

Exercise  8.2.1  y  —  x  —x  and  y  =  x  ( the  part  to  the  right  of  the  y-axis) 

Exercise  8.2.2  x  —  y3  and  x  —  y2 
Exercise  8.2.3  x—  1  —  y2  and  y  —  —x  —  1 

Exercise  8.2.4  x  =  3 y  —  y2  and  x+y  —  3 

Exercise  8.2.5  y  —  cos(kx/2)  and  y  —  1  —  x2  (in  the  first  quadrant) 
Exercise  8.2.6  y  —  s i  n  (  tt.v / 3 )  and  y  —  x  (in  the  first  quadrant ) 

Exercise  8.2.7  y  —  y/x  and  y  —  x2 

Exercise  8.2.8  y  —  yfx  and  y  —  Vx+  1,  0  <x<  4 

Exercise  8.2.9  x  —  0  and  x  —  25—y2 

Exercise  8.2.10  y  —  sinxcos^:  and y  —  sinx,  0  <x<n 

Exercise  8.2.11  y  —  x3^2  and  y  —  x2^3 

Exercise  8.2.12  y  —  x2  —  2x  and  y  —  x  — 2 


X 
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8.3  Volume 


Now  that  we  have  seen  how  to  compute  certain  areas  by  using  integration;  we  will  now  look  into  how  some 
volumes  may  also  be  computed  by  evaluating  an  integral.  Generally,  the  volumes  that  we  can  compute 
this  way  have  cross-sections  that  are  easy  to  describe. 


Figure  8.3:  Volume  of  a  pyramid  approximated  by  rectangular  prisms. 


Example  8.7:  Volume  of  a  Pyramid 


Find  the  volume  of  a  pyramid  with  a  square  base  that  is  20  meters  tall  and  20  meters  on  a  side  at  the 
base. 


Solution.  As  with  most  of  our  applications  of  integration,  we  begin  by  asking  how  we  might  approximate 
the  volume.  Since  we  can  easily  compute  the  volume  of  a  rectangular  prism  (that  is,  a  “box”),  we  will 
use  some  boxes  to  approximate  the  volume  of  the  pyramid,  as  shown  in  figure  8.3:  On  the  left  is  a  cross- 
sectional  view,  on  the  right  is  a  3D  view  of  part  of  the  pyramid  with  some  of  the  boxes  used  to  approximate 
the  volume. 

Each  box  has  volume  of  the  form  (2jc;)  (2x,)Ay.  Unfortunately,  there  are  two  variables  here;  fortunately, 
we  can  write  x  in  terms  of  y:  From  the  cross-sectional  view  we  see  that  a  height  of  20  is  achieved  at  the 
midpoint  of  the  base.  We  will  also  position  the  cross-sectional  view  symmetrically  about  the  y-axis.  Thus 
at  x  —  0,  y  —  20,  and  we  have  a  slope  of  m  —  —2.  So 

y  —  —2x  +  b 
20  =  -2(0  )+b 
20  =  b. 
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Therefore,  y  —  20  —  2x,  and  in  the  terms  of  x:  x  —  10  —  y/2  or  Xi 
approximately 


52  4(10  —  y,-/2)2Ay 

i=0 

and  in  the  limit  we  get  the  volume  as  the  value  of  an  integral: 


10  —  y i/2.  Then  the  total  volume  is 


r20  r20 
/  4(10-  y/2)2  dy=  /  (20  -y)2dy  = 

Jo  Jo 

As  you  may  know,  the  volume  of  a  pyramid  is  (1/3)  (height)  (area  of  base) 
agrees  with  our  answer. 


203  _  8000 
3~  ~  I-' 

=  (1/3)  (20)  (400),  which 

* 


Example  8.8:  Volume  of  an  Object 


The  base  of  a  solid  is  the  region  between  fix')  —  x2  —  1  and  g(x)  —  —x2  +  1,  and  its  cross-sections 
perpendicular  to  the  x-axis  are  equilateral  triangles,  as  indicated  in  Figure  8.4.  The  solid  has  been 
truncated  to  show  a  triangular  cross-section  above  x—  1/2.  Find  the  volume  of  the  solid. 


Figure  8.4:  Solid  with  equilateral  triangles  as  cross-sections. 


Solution.  A  cross-section  at  a  value  x;  on  the  x-axis  is  a  triangle  with  base  2(1  —  xf)  and  height  V3  ( 1  —  xj ) , 
so  the  area  of  the  cross-section  is 

\  (base)  (height)  =  (1  -x?)\/3(l -xf), 
and  the  volume  of  a  thin  “slab”  is  then 

(1  —xf)V 3(1  —xf)  Ax. 


x/3(  1  —x2)2dx  — 


16 

15 


y/3. 


Thus  the  total  volume  is 
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X 

One  easy  way  to  get  “nice”  cross-sections  is  by  rotating  a  plane  figure  around  a  line.  For  example,  in 
Figure  8.5  we  see  a  plane  region  under  a  curve  and  between  two  vertical  lines;  then  the  result  of  rotating 
this  around  the  x-axis,  and  a  typical  circular  cross-section  is  a  circle. 


Figure  8.5:  A  solid  of  rotation. 


Of  course  a  real  “slice”  of  this  figure  will  not  have  straight  sides,  but  we  can  approximate  the  volume 
of  the  slice  by  a  cylinder  or  disk  with  circular  top  and  bottom  and  straight  sides;  the  volume  of  this  disk 
will  have  the  form  nr  Ax.  As  long  as  we  can  write  r  in  terms  of  x  we  can  compute  the  volume  by  an 
integral. 


Example  8.9:  Volume  of  a  Right  Circular  Cone 


Find  the  volume  of  a  right  circular  cone  with  base  radius  10  and  height  20.  (A  right  circular  cone  is 
one  with  a  circular  base  and  with  the  tip  of  the  cone  directly  over  the  center  of  the  base.) 


Solution.  We  can  view  this  cone  as  produced  by  the  rotation  of  the  line  y  —  x/2  rotated  about  the  x-axis, 
as  indicated  in  figure  8.6. 


Figure  8.6:  A  region  that  generates  a  cone;  approximating  the  volume  by  circular  disks. 


At  a  particular  point  on  the  x-axis,  say  x;,  the  radius  of  the  resulting  cone  is  the  y-coordinate  of  the 


8.3.  Volume  315 


corresponding  point  on  the  line,  namely  >7  =  x-,/2.  Thus  the  total  volume  is  approximately 


n—  1 

^  n(xi/2)2 dx 
i= 0 


and  the  exact  volume  is 


c2 0  x2 
n—dx  — 


K  203  20007T 


Jo  4  4  3  3 

Note  that  we  can  instead  do  the  calculation  with  a  generic  height  and  radius: 


fh  y2 

K—^x1  dx  = 


Kr~  h2  nr2h 


JO  'h2  h2  3  3 

giving  us  the  usual  formula  for  the  volume  of  a  cone. 


X 


Example  8.10:  Volume  of  an  Object  with  a  Hole 


Find  the  volume  of  the  object  generated  when  the  area  between  y  —  x2  and  y  —  x  is  rotated  around 
the  x-axis. 


Solution.  This  solid  has  a  “hole”  in  the  middle;  we  can  compute  the  volume  by  subtracting  the  volume 
of  the  hole  from  the  volume  enclosed  by  the  outer  surface  of  the  solid.  In  figure  8.7  we  show  the  region 
that  is  rotated,  the  resulting  solid  with  the  front  half  cut  away,  the  cone  that  forms  the  outer  surface,  the 
horn-shaped  hole,  and  a  cross-section  perpendicular  to  the  x-axis. 
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Figure  8.7:  Solid  with  a  hole,  showing  the  outer  cone  and  the  shape  to  be  removed  to  form  the  hole. 


We  have  already  computed  the  volume  of  a  cone;  in  this  case  it  is  nj 3.  At  a  particular  value  of  x,  say 
X;,  the  cross-section  of  the  horn  is  a  circle  with  radius  xj,  so  the  volume  of  the  horn  is 

/  7i(x2)2dx=  f  7ix4dx  —  7i~, 

Jo  Jo  5 

so  the  desired  volume  is  k/3  —  tz/5  —  2tt / 15 . 

As  with  the  area  between  curves,  there  is  an  alternate  approach  that  computes  the  desired  volume  “all 
at  once”  by  approximating  the  volume  of  the  actual  solid.  We  can  approximate  the  volume  of  a  slice  of 
the  solid  with  a  washer-shaped  volume,  as  indicated  in  Figure  8.7. 

The  volume  of  such  a  washer  is  the  area  of  the  face  times  the  thickness.  The  thickness,  as  usual,  is  Ax, 

9  9 

while  the  area  of  the  face  is  the  area  of  the  outer  circle  minus  the  area  of  the  inner  circle,  say  kR~  —  nr  ,  or 
7r(TOP)2  —  ^(BOTTOM)2.  In  the  present  example,  at  a  particular  x,,  the  radius  R  (The  “TOP”  function) 
is  xi  and  r  (The  “BOTTOM”  function)  xj.  Hence,  the  whole  volume  is 


l 


71  (TOP2  -  BOTTOM2)  dx 


^  4  i 

TlX  —  TlX  dx  —  71 


2k 

15' 


Of  course,  what  we  have  done  here  is  exactly  the  same  calculation  as  before,  except  we  have  in  effect 
recomputed  the  volume  of  the  outer  cone.  X 
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Suppose  the  region  between  /(x)  =  x+  1  and  g(x)  =  (x—  l)2  is  rotated  around  the  y-axis;  see  Fig¬ 
ure  8.8.  It  is  possible,  but  inconvenient,  to  compute  the  volume  of  the  resulting  solid  by  the  method  we 
have  used  so  far.  The  problem  is  that  there  are  two  “kinds”  of  typical  rectangles:  Those  that  go  from  the 
line  to  the  parabola  and  those  that  touch  the  parabola  on  both  ends.  To  compute  the  volume  using  this 
approach,  we  need  to  break  the  problem  into  two  parts  and  compute  two  integrals: 

71  L  (\  +  y/y)2 -(y-l)2dy=  ^7T  +  ^7T  =  yTT. 


If  instead  we  consider  a  typical  vertical  rectangle,  but  still  rotate  around  the  y-axis,  we  get  a  thin  “shell” 
instead  of  a  thin  “washer”.  Note  that  “washers”  are  related  to  the  area  of  a  circle,  nr2,  whereas  “shells” 
are  related  to  the  surface  area  of  a  cylinder,  2nrh.  If  we  add  up  the  volume  of  such  thin  shells  we  will  get 
an  approximation  to  the  true  volume.  What  is  the  volume  of  such  a  shell?  Consider  the  shell  at  Xj.  Imagine 
that  we  cut  the  shell  vertically  in  one  place  and  “unroll”  it  into  a  thin,  flat  sheet,  namely  the  surface  of  a 
cylinder.  This  sheet  will  be  almost  a  rectangular  prism  that  is  Ax  thick,  /(x,-)  —  g(x,-)  (TOP— BOTTOM) 
tall,  and  2nx,-  wide.  The  volume  will  then  be  approximately  the  volume  of  a  rectangular  prism  with  these 
dimensions:  27Cx!(/(x;)  —  g(xi))Ax.  If  we  add  these  up  and  take  the  limit  as  usual,  we  get  the  integral 


r  3  r  3  r3  'YJ 

/  2 nx{f{x)—g{x))dx—  /  2nx (TOP  —  BOTTOM)  dx  =  /  2nx{x+  1  —  (x  —  l)2)dx  — — n. 

Jo  Jo  Jo  2 

Not  only  does  this  accomplish  the  task  with  only  one  integral,  the  integral  is  somewhat  easier  than  those  in 
the  previous  calculation.  Things  are  not  always  so  neat,  but  it  is  often  the  case  that  one  of  the  two  methods 
will  be  simpler  than  the  other,  so  it  is  worth  considering  both  before  starting  to  do  calculations. 


Example  8.11: 


Suppose  the  area  under  y  —  — x2  +  1  between  x  —  0  and  x  —  1  is  rotated  around  the  x-axis.  Find  the 
volume  by  both  methods. 


Solution,  Using  the  disk  method  we  obtain: 

r  1  o  o  g 

/  7r(l  —jc)2dx=  —  n. 
Jo  15 
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Using  the  shell  method  we  obtain: 


/  2ny  \f\  -ydy  =  ^-n. 
Jo  15 


X 


Exercises  for  8.3 


/■'  0  „  /4  0  _  8  65  27 

Exercise 8.3.1  Verify  that  Tt  (\  + JyY  —  — JyY dy  +  7i  \  {\-\-JyY  —  (y—l)-  —  -7i  +  —7t——iz. 

Jo  ’  Ji  ’  362 

f3  t  27 

Exercise  8.3.2  Ver//V  tfrat  /  2ttx(x  +  1  —  (x  —  l)2)  dx  =  —  it. 

Jo  2 

8 

Exercise  8.3.3  UrZ/y  that  /  7r(  1  —x~)~dx  —  —K. 

Jo  15 

Exercise  8.3.4  Verify  that  [  lizyJX  —ydy  — 

Jo  '  15 


Exercise  8.3.5  Use  integration  to  find  the  volume  of  the  solid  obtained  by  revolving  the  region  bounded 
by  x  +  y  —  2  and  the  x  and  y  axes  around  the  x-axis. 


Exercise  8.3.6  Find  the  volume  of  the  solid  obtained  by  revolving  the  region  bounded  by  y  —  x  —  x  and 
the  x-axis  around  the  x-axis. 


Exercise  8.3.7  Find  the  volume  of  the  solid  obtained  by  revolving  the  region  bounded  by  y  =  y/ sinx 
between  x  —  0  and  x—Tz/2,  the  y-axis,  and  the  line  y  =  1  around  the  x-axis. 

Exercise  8.3.8  Let  S  be  the  region  of  the  xy-plane  bounded  above  by  the  cur\>e  x  y  —  64,  below  by  the  line 
y  on  the  left  by  the  line  x  —  2,  and  on  the  right  by  the  line  x  —  4.  Find  the  volume  of  the  solid  obtained 
by  rotating  S  around: 

(a)  the  x-axis;  (c)  the  y-axis;  and 

(b)  the  line  y  =  1 ;  (d)  the  line  x—2. 


Exercise  8.3.9  The  equation  x2 /9-t-v2 /4  =  1  describes  an  ellipse.  Find  the  volume  of  the  solid  obtained 
by  rotating  the  ellipse  around  the  x-axis  and  also  around  the  y-axis.  These  solids  are  called  ellipsoids;  one 
is  vaguely  rugby-ball  shaped,  one  is  sort  of  flying-saucer  shaped,  or  perhaps  squished-beach-ball-shaped. 
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Figure  8.9:  Ellipsoids. 


Exercise  8.3.10  Use  integration  to  compute  the  volume  of  a  sphere  of  radius  r.  You  should  of  course  get 
the  well-known  formula  4nr/3. 

Exercise  8.3.11  A  hemispheric  bowl  of  radius  r  contains  water  to  a  depth  h.  Find  the  volume  of  water  in 
the  bowl. 

Exercise  8.3.12  The  base  of  a  tetrahedron  (a  triangular  pyramid)  of  height  h  is  an  equilateral  tri¬ 
angle  of  side  s.  Its  cross-sections  perpendicular  to  an  altitude  are  equilateral  triangles.  Express  its 
volume  V  as  an  integral,  and  find  a  formula  for  V  in  terms  of  h  and  s.  Verify  that  your  answer  is 
(1/3) (area  ofbase)(height). 

Exercise  8.3.13  The  base  of  a  solid  is  the  region  between  f(x)  —  cosx  and  g(x)  —  —  cosx,  —  k/2  <  x  < 
K /  2,  and  its  cross-sections  perpendicular  to  the  x-axis  are  squares.  Find  the  volume  of  the  solid. 


8.4  Average  Value  of  a  Function 


The  average  of  some  finite  set  of  values  is  a  familiar  concept.  If,  for  example,  the  class  scores  on  a  quiz 
are  10,  9,  10,  8,  7,  5,  7,  6,  3,  2,  7,  8,  then  the  average  score  is  the  sum  of  these  numbers  divided  by  the  size 
of  the  class: 

10  +  9+10  +  8  +  7  +  5  +  7  +  6  +  3  +  2  +  7  +  8  82  ^  oo 

average  score  = - - - =  —  ~  6.83. 

&  12  12 

Suppose  that  between  t  —  0  and  t  —  1  the  speed  of  an  object  is  sin(TTt).  What  is  the  average  speed  of  the 

object  over  that  time?  The  question  sounds  as  if  it  must  make  sense,  yet  we  can’t  merely  add  up  some 

number  of  speeds  and  divide,  since  the  speed  is  changing  continuously  over  the  time  interval. 

To  make  sense  of  “average”  in  this  context,  we  fall  back  on  the  idea  of  approximation.  Consider  the 

speed  of  the  object  at  tenth  of  a  second  intervals:  sinO,  sin(0. 1  tt)  ,  sin(0.27r),  sin(0.37r),. . . ,  sin(0.97r).  The 

average  speed  “should”  be  fairly  close  to  the  average  of  these  ten  speeds: 

^  Esin(^/10)  «  ^6.3  =  0.63. 

Of  course,  if  we  compute  more  speeds  at  more  times,  the  average  of  these  speeds  should  be  closer  to  the 
“real”  average.  If  we  take  the  average  of  n  speeds  at  evenly  spaced  times,  we  get: 


1 

n 


n—  1 


i 


sin  (7 li/n). 
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Here  the  individual  times  are  f;  =  i/n,  so  rewriting  slightly  we  have 

^  n—  1 

-  £  sin(7Wf). 

7=0 

This  is  almost  the  sort  of  sum  that  we  know  turns  into  an  integral;  what’s  apparently  missing  is  At — but  in 
fact,  At  —  l/n,  the  length  of  each  subinterval.  So  rewriting  again: 

77—1  J  77—1 

^  sin (nti)-  —  ^  sm(nti)At. 

7=0  n  7=0 


Now  this  has  exactly  the  right  form,  so  that  in  the  limit  we  get 

l 


average  speed  —  f'sin(m)d,= 

Jo  71 


cos(7r)  cos(0) 


0 


K 


K 


—  ~  0.6366  ~  0.64. 

n 


It’s  not  entirely  obvious  from  this  one  simple  example  how  to  compute  such  an  average  in  general. 
Let’s  look  at  a  somewhat  more  complicated  case.  Suppose  that  the  velocity  of  an  object  is  16t1 2  +  5  feet 
per  second.  What  is  the  average  velocity  between  t  —  1  and  t  =  31  Again  we  set  up  an  approximation  to 
the  average: 


1 

n 


n—  1 


i 


16/f  +  5, 


where  the  values  tt  are  evenly  spaced  times  between  1  and  3.  Once  again  we  are  “missing”  At,  and  this 
time  l/n  is  not  the  correct  value.  What  is  At  in  general?  It  is  the  length  of  a  subinterval;  in  this  case  we 
take  the  interval  [1,3]  and  divide  it  into  n  subintervals,  so  each  has  length  (3  —  l)/n  =  2/n  =  At.  Now  with 
the  usual  “multiply  and  divide  by  the  same  thing”  trick  we  can  rewrite  the  sum: 


1 

n 


n—  1 


i 


16/f +  5 


1 

3^1 


"eW  +  5) 


7=0 


3-1 

n 


1  77  1  9 

2.L(16,'2  +  5)- 


W(16<,2  +  5)A7. 

Z  7=0 


In  the  limit  this  becomes 


1  /*  2  c  j  1446  223 

2  Ji  2  3  3 


Does  this  seem  reasonable?  Let’s  picture  it:  In  Figure  8.10  we  see  the  velocity  function  together  with  the 
horizontal  line  y  =  223 /3  ~  74.3.  Certainly  the  height  of  the  horizontal  line  looks  at  least  plausible  for  the 
average  height  of  the  curve. 
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Figure  8.10:  Average  velocity. 


Here’s  another  way  to  interpret  “average”  that  may  make  our  computation  appear  even  more  reason¬ 
able.  The  object  of  our  example  goes  a  certain  distance  between  t  —  1  and  1  —  3.  If  instead  the  object  were 
to  travel  at  the  average  speed  over  the  same  time,  it  should  go  the  same  distance.  At  an  average  speed  of 
223/3  feet  per  second  for  two  seconds  the  object  would  go  446/3  feet.  How  far  does  it  actually  go?  We 
know  how  to  compute  this: 

/'3  r 3  44g 

j  v(t)dt  —  I  16r  +  5  dt  —  — . 

So  now  we  see  that  another  interpretation  of  the  calculation  is: 


1 

-  /  \6t2  +  5dt 

2 ./ 1 


1446 
2  ~ 3~ 


223 

^T 


which  is  the  total  distance  traveled  divided  by  the  time  in  transit,  namely,  the  usual  interpretation  of  average 
speed. 


In  the  case  of  speed,  or  more  properly  velocity,  we  can  always  interpret  “average”  as  total  (net)  dis¬ 
tance  divided  by  time.  However,  in  the  case  of  a  different  sort  of  quantity  this  interpretation  does  not 
obviously  apply,  while  the  approximation  approach  always  does.  We  might  interpret  the  same  problem 
geometrically:  What  is  the  average  height  of  16v2 +  5  on  the  interval  [1,3]?  We  approximate  this  in  exactly 
the  same  way,  by  adding  up  many  sample  heights  and  dividing  by  the  number  of  samples.  In  the  limit  we 
get  the  same  result: 


1 

2. 


16x  +5dx 


1446 

2~3~ 


223 

IT' 


We  can  interpret  this  result  in  a  slightly  different  way.  The  area  under  y  =  I6x2  +  5  above  [1,3]  is 


16 1  -|-  5  dt 


446 

IT' 


The  area  under  y  —  223 / 3  over  the  same  interval  [1,3]  is  simply  the  area  of  a  rectangle  that  is  2  by  223 /  3 
with  area  446/3.  So  the  average  height  of  a  function  is  the  height  of  the  horizontal  line  that  produces  the 
same  area  over  the  given  interval. 
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Exercises  for  8.4 


Exercise  8.4.1  Find  the  average  height  of  cos.*  over  the  intervals  [0,7r/2],  [— tt/2,  n/2],  and  [0, 2n\. 
Exercise  8.4.2  Find  the  average  height  ofx 2  over  the  internal  [—2,2]. 

Exercise  8.4.3  Find  the  average  height  of  1  /Or  over  the  interval  [1,A]. 

Exercise  8.4.4  Find  the  average  height  of  \f\-x1  over  the  interx’al  [—1,1]. 

Exercise  8.4.5  An  object  moves  with  velocity  v(t)  —  —t2  +  1  feet  per  second  between  t  —  0  and  t  —  2. 
Find  the  average  velocity  and  the  average  speed  of  the  object  between  t  =  0  and  t  —  2. 


Exercise  8.4.6  The  observation  deck  on  the  102nd  floor  of  the  Empire  State  Building  is  1,224  feet  above 
the  ground.  If  a  steel  ball  is  dropped  from  the  observation  deck  its  velocity  at  time  t  is  approximately 
v(l  )  —  —32 1  feet  per  second.  Find  the  average  speed  between  the  time  it  is  dropped  and  the  time  it  hits  the 
ground,  and  find  its  speed  when  it  hits  the  ground. 


8.5  Work 


A  fundamental  concept  in  classical  physics  is  work:  If  an  object  is  moved  in  a  straight  line  against  a  force 
F  for  a  distance  d  the  work  done  is  W  —  Fd. 


Example  8.12:  Constant  Force 


Flow  much  work  is  done  in  lifting  a  10  pound  weight  vertically  a  distance  of  5  feet? 


Solution.  The  force  due  to  gravity  on  a  10  pound  weight  is  10  pounds  at  the  surface  of  the  earth,  and  it 
does  not  change  appreciably  over  5  feet.  The  work  done  is  W  =  10  •  5  —  50  foot-pounds.  & 

In  reality  few  situations  are  so  simple.  The  force  might  not  be  constant  over  the  range  of  motion,  as  in 
the  next  example. 


Example  8.13:  Lifting  a  Weight 


How  much  work  is  done  in  lifting  a  10  pound  weight  from  the  surface  of  the  earth  to  an  orbit  100 
miles  above  the  surface? 


Solution.  Over  100  miles  the  force  due  to  gravity  does  change  significantly,  so  we  need  to  take  this  into 
account.  The  force  exerted  on  a  10  pound  weight  at  a  distance  r  from  the  center  of  the  earth  is  F  =  k/r2 
and  by  definition  it  is  10  when  r  is  the  radius  of  the  earth  (we  assume  the  earth  is  a  sphere).  How  can  we 
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approximate  the  work  done?  We  divide  the  path  from  the  surface  to  orbit  into  n  small  subpaths.  On  each 
subpath  the  force  due  to  gravity  is  roughly  constant,  with  value  k/rf  at  distance  r,.  The  work  to  raise  the 
object  from  r,  to  rl+\  is  thus  approximately  kj rjAr  and  the  total  work  is  approximately 


n-\  7 

E4A7, 


i= 0  ri 


or  in  the  limit 

n  k 

W  —  dr , 

Jr0  r 

where  ro  is  the  radius  of  the  earth  and  r\  is  ro  plus  100  miles.  The  work  is 

k  k 

- I - • 

r\  r0 

Using  ro  =  20925525  feet  we  have  r\  —  21453525.  The  force  on  the  10  pound  weight  at  the  surface  of  the 
earth  is  10  pounds,  so  10  =  kj 209255252,  giving  k  —  4378775965256250.  Then 


k  k 

- 1 - 

n  ro 


491052320000 

95349 


« 5150052 


foot-pounds. 


Note  that  if  we  assume  the  force  due  to  gravity  is  10  pounds  over  the  whole  distance  we  would  calculate 
the  work  as  10(ri  —  ro)  =  10  ■  100  ■  5280  =  5280000,  somewhat  higher  since  we  don’t  account  for  the 
weakening  of  the  gravitational  force.  & 


Example  8.14:  Lifting  an  Object 


How  much  work  is  done  in  lifting  a  10  kilogram  object  from  the  surface  of  the  earth  to  a  distance  D 
from  the  center  of  the  earth? 


Solution.  This  is  the  same  problem  as  before  in  different  units,  and  we  are  not  specifying  a  value  for  D. 
As  before 


W 


k  k 

D  +  7o 


While  “weight  in  pounds”  is  a  measure  of  force,  “weight  in  kilograms”  is  a  measure  of  mass.  To  convert 
to  force  we  need  to  use  Newton’s  law  F  =  ma.  At  the  surface  of  the  earth  the  acceleration  due  to  gravity 
is  approximately  9.8  meters  per  second  squared,  so  the  force  is  F  —  10  •  9.8  =  98.  The  units  here  are 
“kilogram-meters  per  second  squared”  or  “kg  m/s2”,  also  known  as  a  Newton  (N),  so  F  =  98  N.  The 
radius  of  the  earth  is  approximately  6378.1  kilometers  or  6378100  meters.  Now  the  problem  proceeds  as 
before.  From  F  —  k/r2  we  compute  k:  98  =  k/63781002,  k  =  3.986655642  •  1015.  Then  the  work  is: 


W  =  —  ^  +  6.250538000  •  108  Newton-meters. 

As  D  increases  W  of  course  gets  larger,  since  the  quantity  being  subtracted,  —k/D,  gets  smaller.  But  note 

o 

that  the  work  W  will  never  exceed  6.250538000  •  10  ,  and  in  fact  will  approach  this  value  as  D  gets  larger. 

Q 

In  short,  with  a  finite  amount  of  work,  namely  6.250538000  ■  10  N-m,  we  can  lift  the  10  kilogram  object 
as  far  as  we  wish  from  earth.  £ 
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Next  is  an  example  in  which  the  force  is  constant,  but  there  are  many  objects  moving  different  dis¬ 
tances. 


Example  8.15:  Multiple  Objects  Moving 


Suppose  that  a  water  tank  is  shaped  like  a  right  circular  cone  with  the  tip  at  the  bottom,  and  has 
height  10  meters  and  radius  2  meters  at  the  top.  If  the  tank  is  full,  how  much  work  is  required  to 
pump  all  the  water  out  over  the  top? 


Solution.  Here  we  have  a  large  number  of  atoms  of  water  that  must  be  lifted  different  distances  to  get  to 
the  top  of  the  tank.  Fortunately,  we  don’t  really  have  to  deal  with  individual  atoms — we  can  consider  all 
the  atoms  at  a  given  depth  together. 

To  approximate  the  work,  we  can  divide  the  water  in  the  tank  into  horizontal  sections,  approximate  the 
volume  of  water  in  a  section  by  a  thin  disk,  and  compute  the  amount  of  work  required  to  lift  each  disk  to 
the  top  of  the  tank.  As  usual,  we  take  the  limit  as  the  sections  get  thinner  and  thinner  to  get  the  total  work. 


<-2-> 


Figure  8.11:  Cross-section  of  a  conical  water  tank. 


At  depth  h  the  circular  cross-section  through  the  tank  has  radius  r  —  ( 10  —  h)  / 5 ,  by  similar  triangles, 
and  area  ;r(10  — fi)2/25.  A  section  of  the  tank  at  depth  h  thus  has  volume  approximately  7r(  10  —  h)2  /  25  Ah 
and  so  contains  ok(  1 0  —  h)2  /  25  Ah  kilograms  of  water,  where  <7  is  the  density  of  water  in  kilograms  per 
cubic  meter;  a  «  1000.  The  force  due  to  gravity  on  this  much  water  is  9.8<77t(  10  —  h)2/25Ah,  and  finally, 
this  section  of  water  must  be  lifted  a  distance  h,  which  requires  /? 9 . 8 cr 7T  ( 1 0  —  /i ) 2 / 2 5  A/ ?  Newton-meters  of 
work.  The  total  work  is  therefore 

9.8(77T  r10  ,  980000 

W  —  — - -  /  fi(10  —  h)1  dh  =  — - — k  ~  1026254  Newton-meters. 

25  Jo  3 


* 


A  spring  has  a  “natural  length,”  its  length  if  nothing  is  stretching  or  compressing  it.  If  the  spring 
is  either  stretched  or  compressed  the  spring  provides  an  opposing  force;  according  to  Hooke’s  Law  the 
magnitude  of  this  force  is  proportional  to  the  distance  the  spring  has  been  stretched  or  compressed:  F  =  kx. 
The  constant  of  proportionality,  k,  of  course  depends  on  the  spring.  Note  that  x  here  represents  the  change 
in  length  from  the  natural  length. 
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Example  8.16:  Compressing  a  Spring 


Suppose  k  —  5  for  a  given  spring  that  has  a  natural  length  of  0. 1  meters.  Suppose  a  force  is  applied 
that  compresses  the  spring  to  length  0.08.  What  is  the  magnitude  of  the  force? 


Solution.  Assuming  that  the  constant  k  has  appropriate  dimensions  (namely,  kg/s2),  the  force  is  5(0.1  — 
0.08)  =  5(0.02)  =  0.1  Newtons.  £ 


Example  8.17:  Compressing  a  Spring  (continued) 


How  much  work  is  done  in  compressing  the  spring  in  the  previous  example  from  its  natural  length 
to  0.08  meters?  From  0.08  meters  to  0.05  meters?  How  much  work  is  done  to  stretch  the  spring 
from  0.1  meters  to  0.15  meters? 


Solution.  We  can  approximate  the  work  by  dividing  the  distance  that  the  spring  is  compressed  (or  stretched) 
into  small  subintervals.  Then  the  force  exerted  by  the  spring  is  approximately  constant  over  the  subinter¬ 
val,  so  the  work  required  to  compress  the  spring  from ; q  to  x;+  \  is  approximately  5  (x,  —  0.1)  At.  The  total 
work  is  approximately 

n—  1 

^  5(x,-  —  0.1)  At 
i= 0 

and  in  the  limit 


rO.o  8  5(x  — 0  1)2 

W=  /  5(x-0.1)rfx  =  '  ' 

.70.1  2 


0.08 


0.1 


5(0.08  —  0.1)2  5(0.1 -0.1)2  1 


1000 


N-m. 


The  other  values  we  seek  simply  use  different  limits.  To  compress  the  spring  from  0.08  meters  to  0.05 
meters  takes 


W  = 


r0.05 

/  5(x  —  0.1)<fx  = 

J  0.08 


5(x  — 0.1) 


2  0.05 


5(0.05  —  0.1)2  5(0.08  — 0.1)2  21 


0.08 


4000 


N-m 


and  to  stretch  the  spring  from  0.1  meters  to  0.15  meters  requires 


/•0.15 

W=  /  5(x  —  0.1)  Jx  = 

70.1 


5(x-0.1)- 


5(0.15  —  0.1)2  5(0.1  -0.1)2  1 


o.i 


160 


N-m. 


Exercises  for  8.5 


Exercise  8.5.1  How  much  work  is  done  in  lifting  a  100  kilogram  weight  from  the  surface  of  the  earth  to 
an  orbit  35,786  kilometers  above  the  surface  of  the  earth? 
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Exercise  8.5.2  How  much  work  is  done  in  lifting  a  100  kilogram  weight  from  an  orbit  1000  kilometers 
above  the  surface  of  the  earth  to  an  orbit  35,786  kilometers  above  the  surface  of  the  earth? 

Exercise  8.5.3  A  water  tank  has  the  shape  of  an  upright  cylinder  with  radius  r  —  1  meter  and  height  10 
meters.  If  the  depth  of  the  water  is  5  meters,  how  much  work  is  required  to  pump  all  the  water  out  the  top 
of  the  tank? 

Exercise  8.5.4  Suppose  the  tank  of  the  previous  problem  is  lying  on  its  side,  so  that  the  circular  ends  are 
vertical,  and  that  it  has  the  same  amount  of  water  as  before.  How  much  work  is  required  to  pump  the 
water  out  the  top  of  the  tank  (which  is  now  2  meters  above  the  bottom  of  the  tank)? 

Exercise  8.5.5  A  water  tank  has  the  shape  of  the  bottom  half  of  a  sphere  with  radius  r  —  1  meter.  If  the 
tank  is  full,  how  much  work  is  required  to  pump  all  the  water  out  the  top  of  the  tank? 

Exercise  8.5.6  A  spring  has  constant  k—  10  kg/s2.  How  much  work  is  done  in  compressing  it  1/10  meter 
from  its  natural  length? 

Exercise  8.5.7  A  force  of  2  Newtons  will  compress  a  spring  from  1  meter  (its  natural  length )  to  0.8  meters. 
How  much  work  is  required  to  stretch  the  spring  from  1.1  meters  to  1.5  meters? 

Exercise  8.5.8  A  20  meter  long  steel  cable  has  density  2  kilograms  per  meter,  and  is  hanging  straight 
down.  How  much  work  is  required  to  lift  the  entire  cable  to  the  height  of  its  top  end? 

Exercise  8.5.9  The  cable  in  the  previous  problem  has  a  100  kilogram  bucket  of  concrete  attached  to  its 
lower  end.  How  much  work  is  required  to  lift  the  entire  cable  and  bucket  to  the  height  of  its  top  end? 

Exercise  8.5.10  Consider  again  the  cable  and  bucket  of  the  previous  problem.  How  much  work  is  required 
to  lift  the  bucket  10  meters  by  raising  the  cable  10  meters?  (The  top  half  of  the  cable  ends  up  at  the  height 
of  the  top  end  of  the  cable,  while  the  bottom  half  of  the  cable  is  lifted  10  meters.) 
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Suppose  a  beam  is  10  meters  long,  and  that  there  are  three  weights  on  the  beam:  a  10  kilogram  weight  3 
meters  from  the  left  end,  a  5  kilogram  weight  6  meters  from  the  left  end,  and  a  4  kilogram  weight  8  meters 
from  the  left  end.  Where  should  a  fulcrum  be  placed  so  that  the  beam  balances?  Let’s  assign  a  scale  to 
the  beam,  from  0  at  the  left  end  to  10  at  the  right,  so  that  we  can  denote  locations  on  the  beam  simply  as  x 
coordinates;  the  weights  are  atx  —  3,x  —  6,  and  x  —  8,  as  in  Figure  8.12. 
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Figure  8.12:  A  beam  with  three  masses. 
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Suppose  to  begin  with  that  the  fulcrum  is  placed  at  x  —  5.  What  will  happen?  Each  weight  applies  a 
force  to  the  beam  that  tends  to  rotate  it  around  the  fulcrum;  this  effect  is  measured  by  a  quantity  called 
torque,  proportional  to  the  mass  times  the  distance  from  the  fulcrum.  Of  course,  weights  on  different 
sides  of  the  fulcrum  rotate  the  beam  in  opposite  directions.  We  can  distinguish  this  by  using  a  signed 
distance  in  the  formula  for  torque.  So  with  the  fulcrum  at  5,  the  torques  induced  by  the  three  weights  will 
be  proportional  to  (3  —  5)10  =  —20,  (6  —  5)5  =  5,  and  (8  —  5)4  =  12.  For  the  beam  to  balance,  the  sum  of 
the  torques  must  be  zero;  since  the  sum  is— 20  +  5  +  12  =  — 3,  the  beam  rotates  counter-clockwise,  and  to 
get  the  beam  to  balance  we  need  to  move  the  fulcrum  to  the  left.  To  calculate  exactly  where  the  fulcrum 
should  be,  we  let  x  denote  the  location  of  the  fulcrum  when  the  beam  is  in  balance.  The  total  torque  on 
the  beam  is  then  (3  —  x)  10  +  (6  —  x)5  +  (8  —  x)4  =  92  —  19x.  Since  the  beam  balances  at  x  it  must  be  that 
92  —  19x  =  0  or  x  =  92/19  ~  4.84,  that  is,  the  fulcrum  should  be  placed  at  x  =  92/19  to  balance  the  beam. 

Now  suppose  that  we  have  a  beam  with  varying  density — some  portions  of  the  beam  contain  more 
mass  than  other  portions  of  the  same  size.  We  want  to  figure  out  where  to  put  the  fulcrum  so  that  the  beam 
balances. 
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Figure  8.13:  A  solid  beam. 


Example  8.18:  Balance  Point  of  a  Beam 


Find  the  balance  point  of  a  solid  beam,  illustrated  in  Figure  8.13,  assuming  the  beam  is  10  meters 
long  and  that  the  density  is  1  +  x  kilograms  per  meter  at  location  x  on  the  beam. 


Solution.  To  approximate  the  solution,  we  can  think  of  the  beam  as  a  sequence  of  weights  “on”  a  beam. 
For  example,  we  can  think  of  the  portion  of  the  beam  between  x  —  0  and  x  =  1  as  a  weight  sitting  at  x  =  0, 
the  portion  between  x  —  1  and  x  =  2  as  a  weight  sitting  at  x  =  1,  and  so  on,  as  indicated  in  Figure  8.13.  We 
then  approximate  the  mass  of  the  weights  by  assuming  that  each  portion  of  the  beam  has  constant  density. 
So  the  mass  of  the  first  weight  is  approximately  mo  =  (1  +0)1  =  1  kilograms,  namely,  (1+0)  kilograms 
per  meter  times  1  meter.  The  second  weight  is/??i  =  (1  +  1)1  =  2  kilograms,  and  so  on  to  the  tenth  weight 
with  mg  =  (1+9)  1  =  10  kilograms.  So  in  this  case  the  total  torque  is 

(0  —  x)rao  +  (1  —x)m\  4 - 1-  (9  —  x)mg  —  (0  —  x)  1  +  (1  — x)2  -| - f  (9  —  x)10. 

If  we  set  this  to  zero  and  solve  for  x  we  get  x  =  6.  In  general,  if  we  divide  the  beam  into  n  portions,  the 
mass  of  weight  number  i  will  be  «?,■  =  (1  +  x,-)(x,-+i  —  x;)  =  (1  +  x,-)Ax  and  the  torque  induced  by  weight 
number  i  will  be  (x,  —  x)m,  =  (x,-  —  jc)  (1  +x,)Ax.  The  total  torque  is  then 

(xo  —  x) ( 1  +xo)Ax+  (xi  —  x)(l  +xi)Ax-t - b  (x„_i  -x)(l  +x„_i)Ar 

n— 1  n— 1 

=  ^  X,(l  +Xi) Ax  —  ^  x(l  +x;)Ax 
i= 0  (=0 

n— 1  n— 1 

=  ^  x,(l +x;-)Ar  — x  ^  (1 +x;-)Ar. 

i= 0  i=0 
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If  we  set  this  equal  to  zero  and  solve  for  x  we  get  an  approximation  to  the  balance  point  of  the  beam: 

n—  1  n—  1 

0  =  Y  *«'(!  +Xi)Ax-x  Y  (1  +xi)Ax 

i= 0  ;=0 

n—  1  n—  1 

X  ^  (1  +  Xj)Ax  =  X;(l  +Xi)Ax 
i= 0  i=0 

w— 1 

^Xf(l  +  x;)Ax 

7=0 

X  =  — - - . 

J^(l  +  X;)Ax 

7=0 


The  denominator  of  this  fraction  has  a  very  familiar  interpretation.  Consider  one  term  of  the  sum  in  the 
denominator:  (1  +  x;)  Ax.  This  is  the  density  near  xt  times  a  short  length,  Ax,  which  in  other  words  is 
approximately  the  mass  of  the  beam  between  x,  and  xi+  \ .  When  we  add  these  up  we  get  approximately 
the  mass  of  the  beam. 


Now  each  of  the  sums  in  the  fraction  has  the  right  form  to  turn  into  an  integral,  which  in  turn  gives  us 
the  exact  value  of  x: 


■  to 


x(l  +x)  dx 


X  = 


-10 


/  (l+x)dx 

Jo 

The  numerator  of  this  fraction  is  called  the  moment  of  the  system  around  zero: 


’10  fW  2 J  H50 

x(l+x)ax=  /  x  +  x  dx  — — - — , 


/  o 


and  the  denominator  is  the  mass  of  the  beam: 


r  10 

/  (l+x)dx  —  60, 
Jo 


and  the  balance  point,  officially  called  the  center  of  mass  is 


x  = 


1150  1 

I- 60 


115 

T8~ 


6.39. 


X 


It  should  be  apparent  that  there  was  nothing  special  about  the  density  function  <r(x)  =  1  +x  or  the 
length  of  the  beam,  or  even  that  the  left  end  of  the  beam  is  at  the  origin.  In  general,  if  the  density  of  the 
beam  is  <r(x)  and  the  beam  covers  the  interval  [a,b\,  the  moment  of  the  beam  around  zero  is 


rb 

Mo  =  /  xo(x)dx 


and  the  total  mass  of  the  beam  is 
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and  the  center  of  mass  is  at 


Example  8.19:  Center  of  Mass  of  a  Beam 


Suppose  a  beam  lies  on  the  x-axis  between  20  and  30,  and  has  density  function  <r(x)  =  x—  19.  Find 
the  center  of  mass. 


Solution.  This  is  the  same  as  the  previous  example  except  that  the  beam  has  been  moved.  Note  that  the 
density  at  the  left  end  is  20  —  19  =  1  and  at  the  right  end  is  30  —  19  =  1 1,  as  before.  Hence  the  center  of 
mass  must  be  at  approximately  20  +  6.39  =  26.39.  Let’s  see  how  the  calculation  works  out. 


Mq  = 


M  = 


Mo 

M 


4750  1 

nr  60 


475 

nr 


«  26.39. 


3 


19x2  30  _  4750 
^  20  ^ 


X 


Example  8.20:  Centroid  of  a  Flat  Plate 


Suppose  a  flat  plate  of  uniform  density  has  the  shape  contained  by  y  —  x2,y  —  1.  and  x  —  0,  in  the 
hrst  quadrant.  Find  the  center  of  mass.  (Since  the  density  is  constant,  the  center  of  mass  depends 
only  on  the  shape  of  the  plate,  not  the  density,  or  in  other  words,  this  is  a  purely  geometric  quantity. 
In  such  a  case  the  center  of  mass  is  called  the  centroid.) 


Solution.  This  is  a  two  dimensional  problem,  but  it  can  be  solved  as  if  it  were  two  one  dimensional 
problems:  we  need  to  find  the  x  and  y  coordinates  of  the  center  of  mass,  x  and  y,  and  fortunately  we  can  do 
these  independently.  Imagine  looking  at  the  plate  edge  on,  from  below  the  x-axis.  The  plate  will  appear  to 
be  a  beam,  and  the  mass  of  a  short  section  of  the  “beam”,  say  between  x,-  and  x,-+i,  is  the  mass  of  a  strip  of 
the  plate  between  x,-  and  x,-+i.  See  Figure  8.14  showing  the  plate  from  above  and  as  it  appears  edge  on. 


Figure  8.14:  Center  of  mass  for  a  two  dimensional  plate. 
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Since  the  plate  has  uniform  density  we  may  as  well  assume  that  <7  =  1.  Then 
between  x,  and  x,+ 1  is  approximately  /»,  =  <7 ( 1  —  xj)Ax  —  (1  —  x2)Ax.  Now  we  can 
around  the  y-axis: 

r  l  j 

Mv—  x(  1  —  x2)  dx  —  - 

Jo  4 

and  the  total  mass 


2 

3 


the  mass  of  the  plate 
compute  the  moment 


and  finally 

__  1  3  _  3 
X~42~  8 


Next  we  do  the  same  thing  to  find  y.  The  mass  of  the  plate  between  y,  and  yl+\  is  approximately  n, 
so 


and 


3 

5’ 


since  the  total  mass  M  is  the  same.  The  center  of  mass  is  shown  in  Figure  8.14. 


X 


Example  8.21:  Center  of  Mass  under  Cosine 


Find  the  center  of  mass  of  a  thin,  uniform  plate  whose  shape  is  the  region  between  y  —  cos  x  and  the 
x-axis  between  x  —  —n/ 2  and  x—  n/2. 


Solution.  It  is  clear  that  x 
we  compute  it  first: 


0,  but  for  practice  let’s  compute  it  anyway.  We  will  need  the  total  mass,  so 


M 


cos  xdx  —  sinx 


jt/2 

—it/ 2 


2. 


The  moment  around  the  y-axis  is 


rJt/2 

My  —  /  xcos.r  dx 

J-tc/2 


cosx  +  xsinx 


jt/2 

—n/2 


0 


and  the  moment  around  the  x-axis  is 


Mx  —  /  y  •  2  arccos ydy  —  y2  arccos y  — 
Jo 


yV i- 


yz  arcsiny 


+ 


K 

4' 


X 


0 

2’ 


K 


«  0.393. 


X 


Thus 
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Exercises  for  8.6 


Exercise  8.6.1  A  beam  10  meters  long  has  density  o  (x)  =  x 2  at  distance  xfrom  the  left  end  of  the  beam. 
Find  the  center  of  mass  x. 

Exercise  8.6.2  A  beam  10  meters  long  has  density  c(x)  —  sin(7Tx/10)  at  distance  xfrom  the  left  end  of 
the  beam.  Find  the  center  of  mass  x. 

Exercise  8.6.3  A  beam  4  meters  long  has  density  a(x)  =  x3  at  distance  xfrom  the  left  end  of  the  beam. 
Find  the  center  of  mass  x. 


Exercise  8.6.4  Verify  that 


o  ,  2  Vl-x2 

2x  arccos  xdx  —  x  arccos  x - - - 


arc sin  x 
+  — x — +C. 


Exercise  8.6.5  A  thin  plate  lies  in  the  region  between  y  —  x  and  the  x-axis  between  x  =  1  and  x  —  2.  Find 
the  centroid. 

Exercise  8.6.6  A  thin  plate  fills  the  upper  half  of  the  unit  circle  x“  +  y  =  1.  Find  the  centroid. 

u  'y 

Exercise  8.6.7  A  thin  plate  lies  in  the  region  contained  byy  —  x  and  y  —  x  .  Find  the  centroid. 

Exercise  8.6.8  A  thin  plate  lies  in  the  region  contained  by  y  —  4  —  x  and  the  x-axis.  Find  the  centroid. 

Exercise  8.6.9  A  thin  plate  lies  in  the  region  contained  by  y  —  x1/3  and  the  x-axis  between  x  =  0  and 
x  —  1.  Find  the  centroid. 

Exercise  8.6.10  A  thin  plate  lies  in  the  region  contained  by  sfxF^fy  —  1  and  the  axes  in  the  first  quadrant. 
Find  the  centroid. 

Exercise  8.6.11  A  thin  plate  lies  in  the  region  between  the  circle  x  +y~  =  4  and  the  circle  x  +  y  —  1, 
above  the  x-axis.  Find  the  centroid. 

Exercise  8.6.12  A  thin  plate  lies  in  the  region  between  the  circle  x  +y  —4  and  the  circle  x  +  v~  =  1  in 
the  first  quadrant.  Find  the  centroid. 

Exercise  8.6.13  A  thin  plate  lies  in  the  region  between  the  circle  x  +  y  —25  and  the  circle  x“  +_y  =16 
above  the  x-axis.  Find  the  centroid. 


8.7  Arc  Length 


Here  is  another  geometric  application  of  the  integral:  Find  the  length  of  a  portion  of  a  curve.  As  usual,  we 
need  to  think  about  how  we  might  approximate  the  length,  and  turn  the  approximation  into  an  integral. 
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We  already  know  how  to  compute  one  simple  arc  length,  that  of  a  line  segment.  If  the  endpoints  are 

P()  (x(j,  \'oj  and  P\  (A  |  ,vi )  then  the  length  of  the  segment  is  the  distance  between  the  points,  \J  (x\  —  xo)2  +  (yi  — yoj2, 
from  the  Pythagorean  theorem,  as  illustrated  in  Figure  8.15. 


(*t.yt) 


Figure  8.15:  The  length  of  a  line  segment. 

Now  if  the  graph  of  /  is  “nice”  (say,  differentiable)  it  appears  that  we  can  approximate  the  length  of 
a  portion  of  the  curve  with  line  segments,  and  that  as  the  number  of  segments  increases,  and  their  lengths 
decrease,  the  sum  of  the  lengths  of  the  line  segments  will  approach  the  true  arc  length;  see  Figure  8.16. 


Now  we  need  to  write  a  formula  for  the  sum  of  the  lengths  of  the  line  segments,  in  a  form  that  we  know 
becomes  an  integral  in  the  limit.  So  we  suppose  we  have  divided  the  interval  [a,  b\  into  n  subintervals  as 
usual,  each  with  length  Ax  —  (, b  —  a)/n ,  and  endpoints  a  —  xq,  x\,  X2,  . . . ,  xn  =  b.  The  length  of  a  typical 

line  segment,  joining  (*,•,/(*,•))  to  (xi+uf(xi+i)),  is  yj (Ax)2  +  {f(xl+\) -/(x,))2.  By  the  Mean  Value 

Theorem,  there  is  a  number  t,-  in  x,+i)  such  that  [' (it) Ax  =  f{xl+\  )  —  /(*,•),  so  the  length  of  the  line 
segment  can  be  written  as 

\J  (Ax)2  +  (/'(f;))2Ax2  =  ^1  +  {f(ti))2Ax. 

Then  arc  length  is: 

iim  l  \! 1 +(f(ti))2Ax  =  [  \! 1 + (f(x))2dx- 

n^°°£b  V  Ja  V 

Note  that  the  sum  looks  a  bit  different  than  others  we  have  encountered,  because  the  approximation  con¬ 
tains  a  tj  instead  of  an  x;  .  In  the  past  we  have  always  used  left  endpoints  (namely,  x,)  to  get  a  representative 
value  of  /  on  [x,-,x,-+ 1] ;  now  we  are  using  a  different  point,  but  the  principle  is  the  same. 
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To  summarize,  to  compute  the  length  of  a  curve  on  the  interval  [a,b\,  we  compute  the  integral 

Ja  \/l  +  (f(x))2dx. 

Unfortunately,  integrals  of  this  form  are  typically  difficult  or  impossible  to  compute  exactly,  because 
usually  none  of  our  methods  for  finding  antiderivatives  will  work.  In  practice  this  means  that  the  integral 
will  usually  have  to  be  approximated. 


Example  8.22:  Circumference  of  a  Circle 


Let  f(x)  —  \J r2  —  x2 ,  the  upper  half  circle  of  radius  r.  The  length  of  this  curve  is  half  the  circum¬ 
ference,  namely  nr.  Compute  this  with  the  arc  length  formula. 


Solution  The  derivative  f'  is  —  xj  y  r2  —x2  so  the  integral  is 


dx. 


Using  a  trigonometric  substitution,  we  find  the  antiderivative,  namely  arcsin(x/ r).  Notice  that  the  integral 

is  improper  at  both  endpoints,  as  the  function  \J \/{r2  —x2)  is  undefined  when  x  —  ±r.  So  we  need  to 
compute 

,°  I  l  ,£>  I  l 

lim  /  \  /  T  dx  +  lim  /  \  ~  dx. 

D^-r+JD  V  rA—XA  D^r~  Jo  V  U  ~XZ 

This  is  not  difficult,  and  has  value  n,  so  the  original  integral,  with  the  extra  r  in  front,  has  value  nr  as 
expected.  X 


Exercises  for  8.7 


Exercise  8.7.1  Find  the  arc  length  of  f{x)  =  x3/2  on  [0,2]. 

Exercise  8.7.2  Find  the  arc  length  of  f(x)  —  x2/8  —  lnx  on  [1,2]. 

Exercise  8.7.3 

Find  the  arc  length  of  f{x)  —  ( 1  /3) (x2  +  2)3,  2  on  the  inten’al  [0, a] . 

Exercise  8.7.4  Find  the  arc  length  off(x)  —  ln(sinx)  on  the  inten’al  [tt/4,  tt/3]. 

Exercise  8.7.5  Let  a  >  0.  Show  that  the  length  ofy  —  coshx  on  [0,  a]  is  equal  to  /  cosh xdx. 

Jo 


Exercise  8.7.6  Find  the  arc  length  of  f(x)  —  coshx  on  [0,ln2]. 
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Exercise  8.7.7  Set  up  the  integral  to  find  the  arc  length  of  si  n  x  on  the  interval  [0,  n] ;  do  not  evaluate  the 
integral.  If  you  have  access  to  appropriate  software,  approximate  the  value  of  the  integral. 

Exercise  8.7.8  Set  up  the  integral  to  find  the  arc  length  ofy  —  xe  x  on  the  interval  [2,3];  do  not  evaluate 
the  integral.  If  you  have  access  to  appropriate  software,  approximate  the  value  of  the  integral. 

Exercise  8.7.9  Find  the  arc  length  ofy  —  ex  on  the  interval  [0,1].  (This  can  be  done  exactly;  it  is  a  bit 
tricky  and  a  bit  long.) 


8.8  Surface  Area 


Another  geometric  question  that  arises  naturally  is:  “What  is  the  surface  area  of  a  volume?”  For  example, 
what  is  the  surface  area  of  a  sphere?  More  advanced  techniques  are  required  to  approach  this  question  in 
general,  but  we  can  compute  the  areas  of  some  volumes  generated  by  revolution. 

As  usual,  the  question  is:  How  might  we  approximate  the  surface  area?  For  a  surface  obtained  by 
rotating  a  curve  around  an  axis,  we  can  take  a  polygonal  approximation  to  the  curve,  as  in  the  last  section, 
and  rotate  it  around  the  same  axis.  This  gives  a  surface  composed  of  many  “truncated  cones”;  a  truncated 
cone  is  called  a  frustum  of  a  cone.  Figure  8.17  illustrates  this  approximation. 


Figure  8.17:  Approximating  a  surface  (left)  by  portions  of  cones  (right). 


So  we  need  to  be  able  to  compute  the  area  of  a  frustum  of  a  cone.  Since  the  frustum  can  be  formed 
by  removing  a  small  cone  from  the  top  of  a  larger  one,  we  can  compute  the  desired  area  if  we  know  the 
surface  area  of  a  cone.  Suppose  a  right  circular  cone  has  base  radius  r  and  slant  height  h.  If  we  cut  the 
cone  from  the  vertex  to  the  base  circle  and  flatten  it  out,  we  obtain  a  sector  of  a  circle  with  radius  h  and  arc 
length  2 nr,  as  in  Figure  8.18.  The  angle  at  the  center,  in  radians,  is  then  Inr/h,  and  the  area  of  the  cone 
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is  equal  to  the  area  of  the  sector  of  the  circle.  Let  A  be  the  area  of  the  sector;  since  the  area  of  the  entire 
circle  is  nh ,  we  have 

A  2nr/h 

nh2  2k 
A  =  nrh. 


Figure  8.18:  The  area  of  a  cone. 

Now  suppose  we  have  a  frustum  of  a  cone  with  slant  height  h  and  radii  ro  and  r\,  as  in  Figure  8.19. 
The  area  of  the  entire  cone  is  nr\(ho  +  h),  and  the  area  of  the  small  cone  is  nroho;  thus,  the  area  of  the 
frustum  is  nr\(ho  +  h)  —  nroho  —  n((r\  —  ro)ho  +  r\h).  By  similar  triangles, 

ho  _  ho  +  h 
r0  r\ 

With  a  bit  of  algebra  this  becomes  {r\  —  ro )  ho  —  roh:  substitution  into  the  area  gives 

r o  T-  r  | 

n{{r\  —  ro)ho  +  r\h)  —  n{r0h  +  r\h)  —  nh(r0  +  n)  =  2  n — - — h  —  2  nrh. 

The  final  form  is  particularly  easy  to  remember,  with  r  equal  to  the  average  of  ro  and  r\ ,  as  it  is  also  the 
formula  for  the  area  of  a  cylinder.  (Think  of  a  cylinder  of  radius  r  and  height  h  as  the  frustum  of  a  cone  of 
infinite  height.) 
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Figure  8.19:  The  area  of  a  frustum. 

Now  we  are  ready  to  approximate  the  area  of  a  surface  of  revolution.  On  one  subinterval,  the  situation 
is  as  shown  in  Figure  8.20.  When  the  line  joining  two  points  on  the  curve  is  rotated  around  the  x-axis,  it 
forms  a  frustum  of  a  cone.  The  area  is 

2 nrh  =  2 \Jl  +  (/'(A))2  Ax. 

Here  1  +  (f'(ti))2Ax  is  the  length  of  the  line  segment,  as  we  found  in  the  previous  section.  Assuming  / 
is  a  continuous  function,  there  must  be  some  x*  in  [x,,x,+ 1  such  that  (/(x,)  +/(x,-+i))/2  =  / (x* ) ,  so  the 
approximation  for  the  surface  area  is 

i=0 

This  is  not  quite  the  sort  of  sum  we  have  seen  before,  as  it  contains  two  different  values  in  the  interval 
[x(,x;_|_i] ,  namely  x*  and  Nevertheless,  using  more  advanced  techniques  than  we  have  available  here,  it 
turns  out  that 

lim  E  2T/(x-  )  J 1  +  (f(ti))2Ax  =  I  2nf{x)  J 1  +{f{x))1dx 

n^°°P0  V  Ja  V 

is  the  surface  area  we  seek.  (Roughly  speaking,  this  is  because  while  x*  and  /,■  are  distinct  values  in 
[xj,x/+i],  they  get  closer  and  closer  to  each  other  as  the  length  of  the  interval  shrinks.) 


Figure  8.20:  One  subinterval. 
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Example  8.23:  Surface  Area  of  a  Sphere 


Compute  the  surface  area  of  a  sphere  of  radius  r. 


Solution.  The  sphere  can  be  obtained  by  rotating  the  graph  of  f(x)  —  'Jr1  —  x2  about  the  x-axis.  The 
derivative  f  is  —xj  \J r 2  —  x2,  so  the  surface  area  is  given  by 


X 


If  the  curve  is  rotated  around  the  y  axis,  the  formula  is  nearly  identical,  because  the  length  of  the  line 
segment  we  use  to  approximate  a  portion  of  the  curve  doesn’t  change.  Instead  of  the  radius  /(x*),  we  use 
the  new  radius  x;  =  (x,  +  x/+i)/2,  and  the  surface  area  integral  becomes 

J  2nx\J  1  +  ( f'{x))2dx . 


Example  8.24:  Surface  Around  y-axis 


Compute  the  area  of  the  surface  formed  when  f  (x)  —  x2  between  0  and  2  is  rotated  around  the 
y-axis. 


Solution.  We  compute  f'{x)  =  2x,  and  then 

2k  f  x\J  1  +  4x2 dx  —  —  (173/2  —  1), 

Jo  6 

by  a  simple  substitution.  X 


Exercises  for  8.8 


Exercise  8.8.1  Compute  the  area  of  the  surface  formed  when  f(x)  —  2\/l  —  x  between  —1  and  0  is  rotated 
around  the  x-axis. 

Exercise  8.8.2  Compute  the  surface  area  of  example  8.24  by  rotating  f(x)  —  \fx  around  the  x-axis. 

Exercise  8.8.3  Compute  the  area  of  the  surface  formed  when  fix)  —  x3  between  1  and  3  is  rotated  around 
the  x-axis. 
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Exercise  8.8.4  Compute  the  area  of  the  surface  formed  when  f{x)  =2  +  cosh(.v')  between  0  and  1  is 
rotated  around  the  x-axis. 

Exercise  8.8.5  Consider  the  surface  obtained  by  rotating  the  graph  of  f(x)  =  1/x,  x  >  1,  around  the  x- 
axis.  This  surface  is  called  Gabriel’s  horn  or  Toricelli’s  trumpet.  Show  that  Gabriel’s  horn  has  infinite 
surface  area. 

Exercise  8.8.6  Consider  the  circle  (x  —  2)2  +  y2  =  1.  Sketch  the  surface  obtained  by  rotating  this  circle 
about  the  y-axis.  (The  surface  is  called  a  torus. )  What  is  the  surface  area? 

Exercise  8.8.7  Consider  the  ellipse  with  equation  x2 /A -\-y2  —  1.  If  the  ellipse  is  rotated  around  the  x-axis 
it  forms  an  ellipsoid.  Compute  the  surface  area. 

Exercise  8.8.8  Generalize  the  preceding  result:  rotate  the  ellipse  given  by  x2/a2  +  y2/b2  =  1  about  the 
x-axis  and  find  the  surface  area  of  the  resulting  ellipsoid.  You  should  consider  two  cases,  when  a>  b  and 
when  a  <  b.  Compare  to  the  area  of  a  sphere. 


9.  Sequences  and  Series 


Consider  the  following  sum: 

1111  1 
2+4  +  8  +  l6  +  "'  +  2;+"' 

The  dots  at  the  end  indicate  that  the  sum  goes  on  forever.  Does  this  make  sense?  Can  we  assign  a 
numerical  value  to  an  infinite  sum?  While  at  first  it  may  seem  difficult  or  impossible,  we  have  certainly 
done  something  similar  when  we  talked  about  one  quantity  getting  “closer  and  closer”  to  a  fixed  quantity. 
Here  we  could  ask  whether,  as  we  add  more  and  more  terms,  the  sum  gets  closer  and  closer  to  some  fixed 
value.  That  is,  look  at 

1  _  1 

2  “  2 

3  1  1 

4  “  2  +  4 

7  1  1  1 

8 ~  2  +  4  +  8 

15  1  1  1  1 

16  “  2  +  4  + 8 +I6 


and  so  on,  and  consider  whether  these  values  have  a  limit.  It  seems  likely  that  they  do,  namely  1 .  In  fact, 
as  we  will  see,  it’s  not  hard  to  show  that 

111  1  i  _  T  -  i  _  1 

2  +  4  +  8  +  l6+'"  +  27_  2'  ~1~2I 


and  then 


lim  1  - 

i — y°o  2l 


which  gets  closer  and  closer  to  1  as  i  gets  larger. 

There  is  a  context  in  which  we  already  implicitly  accept  this  notion  of  infinite  sum  without  really 
thinking  of  it  as  a  sum:  The  representation  of  a  real  number  as  an  infinite  decimal.  For  example, 


0.33333 


3  3  3  3  _  1 

To  +  Too  +  Tooo  +  ioooo  +  " '  “  3’ 


or  likewise 


1  4  1 

3.14159...  =  3H - H - H - 

10  100  1000 


+ 


+ 


10000  100000 


H - =  K. 


An  infinite  sum  is  called  a  series,  and  is  usually  written  using  the  same  sigma  notation  that  we  encountered 
in  Chapter  6.  In  this  case,  however,  we  use  °°  to  indicate  that  there  is  no  ‘last  term’.  The  series  we  first 
examined  can  be  written  as 


11  1  _  “  1 

2  4  2‘  w  2* 

_  i_  1  z. 
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A  related  notion  that  will  aid  our  investigations  is  that  of  a  sequence.  A  sequence  is  just  an  ordered 
(possibly  infinite)  list  of  numbers.  For  example, 


We  will  begin  by  learning  some  useful  facts  about  sequences. 


9.1  Sequences 


While  the  idea  of  a  sequence  of  numbers,  01,02,03,. . .  is  straightforward,  it  is  useful  to  think  of  a  sequence 
as  a  function.  We  have  dealt  with  functions  whose  domains  are  the  real  numbers,  or  a  subset  of  the  real 
numbers,  like  f(x)  —  sinx  A  sequence  can  be  regarded  as  a  function  with  domain  as  the  natural  numbers 
N  =  {1,2,3,...}  or  the  non-negative  integers,  Z-°  =  {0, 1,2, 3, . . .}.  The  range  of  the  function  is  still 
allowed  to  be  the  set  of  all  real  numbers;  we  say  that  a  sequence  is  a  function  /:  N  — >■  M.  Sequences  are 
commonly  denoted  in  several  different,  but  equally  acceptable  ways: 


a\, 02,03,. . . 

w:=i 

{/wCi 


As  with  functions  of  the  real  numbers,  we  will  most  often  encounter  sequences  that  can  be  expressed 
by  a  formula.  We  have  already  seen  the  sequence  a,-  =  f[i)  =  1  —  1/2'.  Some  other  simple  examples  are: 


/( 0 
f(n) 

f(n) 
/( 0 


i 

i  +  1 

1 

2" 

sin(«7r/6) 

(z  —  !)(?'  + 2) 
2' 


Frequently  these  formulas  will  make  sense  if  thought  of  either  as  functions  with  domain  M.  or  N,  though 
occasionally  one  will  make  sense  for  integer  values  only. 

The  main  question  of  interest  when  dealing  with  sequences  is  what  happens  to  the  terms  as  we  go 
further  and  further  down  the  list.  In  particular,  as  i  becomes  extremely  large,  does  a;  get  closer  to  one 
specific  value?  This  is  reminiscent  of  a  question  we  asked  in  Chapter  3,  when  looking  at  limits  of  func¬ 
tions.  In  fact,  the  problems  are  closely  related  and  we  define  the  limit  of  a  sequence  in  a  way  similar  to 
Definition  3.16. 


Definition  9.1:  Limit  of  a  Sequence 


Suppose  that  {cz„}“,  is  a  sequence.  We  say  that  lim  on  =  L  if  for  every  e  >  0  there  is  anN  >  0  so 

/?— >00 

that  whenever  n  >  N,  \on  —  L\  <  e.  If  lim  o„  —Lwe  say  that  the  sequence  converges  toL,  otherwise 

5.00 

it  diverges. 
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Intuitively,  ai—L  means  that  the  further  we  go  in  the  sequence,  the  closer  the  terms  get  to  L. 


Solution.  Suppose  £  >  0.  Then  let  N  =  log^11+£^.  Note  that  N  >  0.  Now  if  n>  N,  then 


n  > 


log2(l+e)  > 

1  +e  > 
e  > 
e  > 


1 

log2(l  +  e) 
1 
n 

21/” 

21/”  -  1 
21/"  - 1 


X 

Note  that,  as  in  Chapter  3,  we  generally  need  to  work  “backwards”  from  the  last  line  of  the  proof  to 
determine  how  to  choose  N.  Having  done  so,  we  write  the  actual  proof  as  we  have  done  here  to  show  that 
this  value  of  N  ‘works’. 

If  a  sequence  is  defined  by  a  formula  {f(i)}Y=\,  we  can  often  expand  the  domain  of  the  function  /  to 
the  set  of  all  (or  almost  all)  real  numbers.  For  example,  /(/)  =  j  is  defined  for  all  non-zero  real  numbers. 

When  this  happens,  we  can  sometimes  find  the  limit  of  the  sequence  {/(/)};! i  more  easily  by  finding 
the  limit  of  the  function  f(x),  x  G  M,  as  x  approaches  infinity. 


Proof.  This  follows  immediately  from  Definitions  3.16  and  9.1.  4k 

Hereafter  we  will  use  the  convention  that  x  refers  to  a  real-valued  variable  and  i  and  n  are  integer¬ 
valued. 


Solution.  Since  limx_>oo  \  —  0,  lim,^oo  \  =  0.  X 

Note  that  the  converse  of  Theorem  9.3  is  not  true. 

Let  f(n)  —  sin (nn).  This  is  the  sequence 


sin(07r) ,  sin(  1  %) ,  sin(27r) ,  sin(37r) , . . .  =  0, 0, 0, 0, . . . 
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since  sin(n7r)  =  0  when  n  is  an  integer.  Thus  lim  f(ri)  —  0.  But  lim  f(x),  when  x  is  real,  does  not  exist: 

n— l°o  x— loo 

as  x  gets  bigger  and  bigger,  the  values  sin(jc7r)  do  not  get  closer  and  closer  to  a  single  value,  but  take  on 
all  values  between  —  1  and  1  over  and  over.  In  general,  whenever  you  want  to  know  lim  f(n)  you  should 

n—¥  o° 

first  attempt  to  compute  lim  f(x),  since  if  the  latter  exists  it  is  also  equal  to  the  first  limit.  But  if  for  some 

X— loo 


reason  lim  f{x)  does  not  exist,  it  may  still  be  true  that  lim  f{n )  exists,  but  you’ll  have  to  figure  out  another 

x— l°o  n— l°o 

way  to  compute  it. 


It  is  occasionally  useful  to  think  of  the  graph  of  a  sequence.  Since  the  function  is  defined  only  for 
integer  values,  the  graph  is  just  a  sequence  of  points.  In  Figure  9.1  we  see  the  graphs  of  two  sequences 
and  the  graphs  of  the  corresponding  real  functions. 


5  1 

4- 

3  - 

2- 

1 

0  - 
0 


fin)  =  l/n 


5 


10 


Not  surprisingly,  the  properties  of  limits  of  real  functions  translate  into  properties  of  sequences  quite 
easily.  Theorem  3.8  about  limits  becomes: 


1 

Theorem  9.5:  Properties  of  Sequences 

Suppose  that  lim  an 

=  L  and  lim  bn 

=  M  and  k  k 

;  some  constant.  Then 

n — loo 

n— loo 

lim  kan 

j  —  k  lim  an  = 

=  kL 

n— 1  °o 

n^oo 

lim  (a„ 

+  bn)  =  lim 

an  +  lim  bn  —  LTM 

n— »°° 

n— loo 

lim  (an 

-  bn)  =  lim 

an  —  lim  bn  —  L  —  M 

n—t  °° 

n— i°o 

n — loo 

lim  (a„. 

bn)  =  lim  an 

•  lim  bn  =  LM 

n— >oo 

n— loo 

n— l°o 

..  (In 
lim  — 

limn — >co  cifi 

L 

—  — ,  ifM  is  not  0 

n  ^°o  jyn 

limn_).oo  bn 

M 

Likewise  the  Squeeze  Theorem  (3.37)  becomes: 
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And  a  final  useful  fact: 


This  says  simply  that  the  size  of  \an\  gets  close  to  zero  if  and  only  if  an  gets  close  to  zero. 


Example  9.8:  Convergence  of  a  Rational  Fraction 

Determine  whether  <J 

f  n  ] 
Ln+1  J 

r 

>  converges  or  diverges.  If  it  converges,  compute  the  limit. 

'  1 1=0 

Solution.  Defining  f(x)  —  we  obtain 

lim  —  lim  1 - ^—  =  1-0=1. 

X  +  1  X->co  X  +  1 

Thus  the  sequence  converges  to  1 .  & 


Example  9.9:  Convergence  of  Ratio  with  Natural  Logarithm 

Determine  whether  <j 

f  In  n ) 

l  n  J 

r 

>  converges  or  diverges.  If  it  converges,  compute  the  limit. 

’  n= 1 

Solution  We  compute 


ln.r  \/x 

lim  —  =  lim - =  0, 

x— x  x— 1 


using  L’Hopital’s  Rule.  Thus  the  sequence  converges  to  0. 


* 


Example  9.10:  Alternating  Ones 


Determine  whether  {(— 1)”}“=0  converges  or  diverges.  If  it  converges,  compute  the  limit. 


Solution.  f(x)  —  (— l)x  is  undefined  for  irrational  values  of  x  so  limx^.00(— 1)*  does  not  exist.  However, 
the  sequence  has  a  very  simple  pattern: 


and  clearly  diverges. 
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Example  9.11:  Convergence  of  Exponential 


Determine  whether  {(— 1/2)"}“=0  converges  or  diverges.  If  it  converges,  compute  the  limit. 

Solution.  We  consider  the  sequence  {|(  —  l/2)n|}7=o  =  {(l/2)"}“=0.  Then 

Um  f  =  Um  ^  =  °’ 

x— \  2J  x— 2X 

so  by  Theorem  9.7  the  sequence  converges  to  0.  X 


Example  9.12:  Using  the  Squeeze  Theorem  for  Sequences 


Determine  whether  {(sinn)  /  y/Ti}™=x  converges  or  diverges.  If  it  converges,  compute  the  limit. 


Solution.  Since  |  sinn|  <  1,  0  <  |  smn/y/n\  <  1  / \fn  and  we  can  use  Theorem  9.6  with  an  —  0  and  cn  — 
1  / \fn.  Since  lim  an  —  lim  cn  —  0,  lim  sin  nj \fn  —  0  and  the  sequence  converges  to  0.  X 

rc— >-oo  ft— »  OO  «— >■  OO 


Example  9.13:  Geometric  Sequence 


Let  r  be  a  fixed  real  number.  Determine  when  {r”}“=0  coverges. 


Solution.  A  particularly  common  and  useful  sequence  is  {rn}™=0,  for  various  values  of  r.  Some  are  quite 
easy  to  understand:  If  r  =  1  the  sequence  converges  to  1  since  every  term  is  1,  and  likewise  if  r  —  0  the 
sequence  converges  to  0.  If  r  —  —  1  this  is  the  sequence  of  Example  9.10  and  diverges.  Ifr>lorr<— 1 
the  terms  r”  get  large  without  limit,  so  the  sequence  diverges.  If  0  <  r  <  1  then  the  sequence  converges  to 
0.  If  —  1  <  r  <  0  then  \rn\  —  \r\n  and  0  <  \r\  <  1,  so  the  sequence  {|r|"},7=o  converges  to  0,  so  also 
converges  to  0.  converges.  In  summary,  {/■”}  converges  precisely  when  —  1  <  r  <  1  in  which  case 


lim  r'1 

n—¥  °° 


0  if  —  1  <  r  <  1 
1  if  r  =  1 


X 

Sequences  of  this  form,  or  the  more  general  form  {kr'l}™=0,  are  called  geometric  sequences  or  geo¬ 
metric  progressions.  They  are  encountered  in  a  large  variety  of  mathematical  and  real-world  applications. 

Sometimes  we  will  not  be  able  to  determine  the  limit  of  a  sequence,  but  we  still  would  like  to  know 
whether  it  converges.  In  some  cases  we  can  determine  this  even  without  being  able  to  compute  the  limit. 

A  sequence  is  called  increasing  or  sometimes  strictly  increasing  if  a/  <  al+\  for  all  i.  It  is  called 
non-decreasing  or  sometimes  (unfortunately)  increasing  if  a\  <  al+\  for  all  i.  Similarly  a  sequence  is 
decreasing  if  a,-  >  a i+  \  for  all  i  and  non-increasing  if  a,-  >  a,;+ 1  for  all  i.  If  a  sequence  has  any  of  these 
properties  it  is  called  monotonic. 
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A  sequence  is  bounded  above  if  there  is  some  number  N  such  that  a„  <  N  for  every  n,  and  bounded 
below  if  there  is  some  number  N  such  that  an  >  N  for  every  n.  If  a  sequence  is  bounded  above  and  bounded 
below  it  is  bounded.  If  a  sequence  {a,7}“=0  is  increasing  or  non-decreasing  it  is  bounded  below  (by  ciq), 
and  if  it  is  decreasing  or  non-increasing  it  is  bounded  above  (by  ciq).  Finally,  with  all  this  new  terminology 
we  can  state  an  important  theorem. 


Theorem  9.15:  Bounded  Monotonic  Sequence 


If  a  sequence  is  bounded  and  monotonic  then  it  converges. 


We  will  not  prove  this,  but  the  proof  appears  in  many  calculus  books.  It  is  not  hard  to  believe:  suppose 
that  a  sequence  is  increasing  and  bounded,  so  each  term  is  larger  than  the  one  before,  yet  never  larger 
than  some  fixed  value  N.  The  terms  must  then  get  closer  and  closer  to  some  value  between  uq  and  N.  It 
need  not  be  N,  since  N  may  be  a  “too-generous”  upper  bound;  the  limit  will  be  the  smallest  number  that 
is  above  all  of  the  terms  a/. 


Example  9.16: 

Determine  whether  <J 

r  2' — 1 1 

oo 

>  converges. 

'  i=i 

l  2‘  J 

Solution.  For  every  i  >  1  we  have  0  <  (2'  —  l)/2'  <  1,  so  the  sequence  is  bounded,  and  we  have  already 
observed  that  it  is  necessary.  Therefore,  the  sequence  converges.  & 

We  don’t  actually  need  to  know  that  a  sequence  is  monotonic  to  apply  this  theorem — it  is  enough  to 
know  that  the  sequence  is  “eventually”  monotonic,  that  is,  that  at  some  point  it  becomes  increasing  or 
decreasing.  For  example,  the  sequence  10,  9,  8,  15,  3,  21,  4,  3/4,  7/8,  15/16,  31/32, . . .  is  not  increasing, 
because  among  the  first  few  terms  it  is  not.  But  starting  with  the  term  3/4  it  is  increasing,  so  the  theorem 
tells  us  that  the  sequence  3/4, 7/8, 15/16,31/32, .. .  converges.  Since  convergence  depends  only  on  what 
happens  as  n  gets  large,  adding  a  few  terms  at  the  beginning  can’t  turn  a  convergent  sequence  into  a 
divergent  one. 
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Solution.  We  first  show  that  this  sequence  is  decreasing,  that  is,  that  n x!n  >  (n  +  Consider  the 

real  function  f{x)  —  xy^x  when  x  >  1.  We  can  compute  the  derivative,  f'(x)  —  x]/'x{  \  —  lnx) /x2,  and  note 
that  when  jc  >  3  this  is  negative.  Since  the  function  has  negative  slope,  r1/”  >  (n  +  when  n  >  3. 

Since  all  terms  of  the  sequence  are  positive,  the  sequence  is  decreasing  and  bounded  when  n  >  3,  and  so 
the  sequence  converges.  (As  it  happens,  we  can  compute  the  limit  in  this  case,  but  we  know  it  converges 
even  without  knowing  the  limit;  see  Exercise  9.1.1.)  X 


Solution.  If  we  look  at  the  ratio  of  successive  terms  we  see  that: 

an+ 1  (n+1)!  nn  (r  +  1)!  nn  r  +  1  /  n  \ 

an  (r  +  1)w+1r!  n\  (n+  1)M+1  /7  +  1  \/7+ly 

Therefore  an+ 1  <  an,  and  so  the  sequence  is  decreasing.  Since  all  terms  are  positive,  it  is  also  bounded, 
and  so  it  must  converge.  (Again  it  is  possible  to  compute  the  limit;  see  Exercise  9.1.2.)  X 


Exercises  for  9.1 


Exercise  9.1.1  Compute  lim  v1'1. 

X— 


'  *■  • 

Exercise  9.1.2  Use  the  squeeze  theorem  to  show  that  lim  —  =  0. 

n— lln 


Exercise  9.1.3  Determine  whether  {Vn  +  47-  y/n}°^=  0  converges  or  diverges.  If  it  converges,  compute 
the  limit. 


Exercise  9.1.4  Determine  whether 
limit. 


n2  +  l  \  m 

- - — ^  >  converges  or  diverges.  If  it  converges,  compute  the 

(n+  1)-  J  n=Q 


Exercise  9.1.5  Determine  whether 
limit. 


« +  47 

Vn2  +  3n  )  n=i 


converges  or  diverges.  If  it  converges,  compute  the 


Exercise  9.1.6  Determine  whether 


2n 


converges  or  diverges. 
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9.2  Series 


While  much  more  can  be  said  about  sequences,  we  now  turn  to  our  principal  interest,  series.  Recall  that  a 
series,  roughly  speaking,  is  the  sum  of  a  sequence:  If  {an}™=0  is  a  sequence  then  the  associated  series  is 


^  an  —  flo  +  a  i  +  +  •  •  • 

i= 0 

Associated  with  a  series  is  a  second  sequence,  called  the  sequence  of  partial  sums  {sn}“=0: 

n 

sn  =  Y*a‘- 

i= 0 

So 

so  =  ao,  si—ao  +  ai,  S2  =  aQ  +  a\  +  a2,  ... 

A  series  converges  if  the  sequence  of  partial  sums  converges,  and  otherwise  the  series  diverges. 

If  {kx'l}™=0  is  a  geometric  sequence,  then  the  associated  series  YT=o^x‘  *s  called  a  geometric  series. 


Theorem  9.19:  Geometric  Series  Convergence 


If\x\  <  1 ,  the  geometric  series  V  kxl  converges  to - ,  otherwise  the  series  diverges  (unless  k  —  0 ). 

;  1  —V 


Proof.  If  an  —  kxJ1,  ^  an  is  called  a  geometric  series.  A  typical  partial  sum  is 

n= 0 

sn  —  k~ b kx  +  kx^  +  kx ^  kx f1  —  k ( 1  x~\~ x ^  o?  •  •  •  H- xf1 ) . 

We  note  that 

Sn  ( 1  —  x)  —  1  X  H-  X ^  +  JC*  H - b  xf1 )  ( 1  —  X ) 

—  k(\  X~\~  X ^  +  •  •  •  +  ^ )  1  —  k{\  H-  X  “h  +  •  •  •  +  X^1  ^  ^ ).X 

=  k(  1  +  x  +  x2  +  x3  4 - bx"  —x  —  x2  —x3 - x"  — x"+1) 

=  k(l-x"+1) 


so 


s„(l  -x)  =  k(  1  -xn+l) 

,  l-x"+1 

s,,  =  k - 


1  —  x 
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If  |x|  <  1,  limx"  =  0  so 

n— 

l—yt+i  i 

lim  sn  —  lim  k - - - =  k - . 

n— >oo  «— >•  oo  1  —  X  1  —  X 

Thus,  when  |x|  <  1  the  geometric  series  converges  to  k/(l  —  x).  X 

When,  for  example,  k  —  1  and  x—  1/2: 

1  —  (l/2),!+1  2n+1  —  1  1  ,  “  1  1 

Sn  ~  1-1/2  ~  2n  _  ~~  2”  an  2"  ~~  1-1/2  ~ 


We  began  the  chapter  with  the  series 


n=  1 


namely,  the  geometric  series  without  the  first  term  1 .  Each  partial  sum  of  this  series  is  1  less  than  the 
corresponding  partial  sum  for  the  geometric  series,  so  of  course  the  limit  is  also  one  less  than  the  value  of 
the  geometric  series,  that  is, 


I 


n=  1 


i 

2n 


1. 


It  is  not  hard  to  see  that  the  following  theorem  follows  from  Theorem  9.5. 


Note  that  when  c  is  non-zero,  the  converse  of  the  first  part  of  this  theorem  is  also  true.  That  is,  i f  £  can 
is  convergent,  then  £ a„  is  also  convergent;  i f  £  can  converges  then  ^£ca„  must  converge. 

On  the  other  hand,  the  converse  of  the  second  part  of  the  theorem  is  not  true.  For  example,  if  an  —  1 
and  bn  —  —  1,  then  L an  +  E bn  =  E  **  =  0  converges,  but  each  of  and  diverges. 

In  general,  the  sequence  of  partial  sums  sn  is  harder  to  understand  and  analyze  than  the  sequence  of 
terms  an,  and  it  is  difficult  to  determine  whether  series  converge  and  if  so  to  what.  The  following  result 
will  let  us  deal  with  some  simple  cases  easily. 


Proof.  Since  converges,  lim  sn  —  L  and  lim  sn-\  =  L,  because  this  really  says  the  same  thing  but 

n— rc— »  oo 

“renumbers”  the  terms.  By  Theorem  9.5, 


lim  (s„  —  sn-i )  =  lim  sn  —  lim  sn _ i  —  L  —  L  —  0. 

«— n— >°°  n— » oo 
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But 

s?i  —  sn—  1  —  ( a0  +  ^1  +  Cl2  H - b  (In)  ~  (a0  +  6(1  +  0-2  3 - \~  an— l)  =  am 

so  as  desired  lim  an  =  0.  X 

n— >oo 

This  theorem  presents  an  easy  divergence  test:  Given  a  series  £a„,  if  the  limit  lim  an  does  not  exist  or 

n— 

has  a  value  other  than  zero,  the  series  diverges.  Note  well  that  the  converse  is  not  true:  If  lim  an  =  0  then 

n—¥  °° 

the  series  does  not  necessarily  converge. 


Proof.  Consider  the  statement  of  the  theorem  in  contrapositive  form: 


If  V  a„  converges,  then  lim  an  —  0. 


If  sn  are  the  partial  sums  of  the  series,  then  the  assumption  that  the  series  converges  gives  us 


lim  sn  —  s 

n— 


for  some  number  s.  Then 


lim  an  —  lim  (sn  —  sn- 1)  =  lim  sn  —  lim  sn- \  —  s  —  s  —  0. 

n— too  }i  ^oo  n— >°o  n— >oo 


X 


Solution.  We  compute  the  limit: 


lim  — n—  =  1^0. 
n-t°°  n  +  1 


Looking  at  the  first  few  terms  perhaps  makes  it  clear  that  the  series  has  no  chance  of  converging: 


1 

2 


+ 


3 

4 


+  ■■■ 


will  just  get  larger  and  larger;  indeed,  after  a  bit  longer  the  series  starts  to  look  very  much  like - b  1  + 

1  +  1  +  1  H - ,  and  of  course  if  we  add  up  enough  l’s  we  can  make  the  sum  as  large  as  we  desire.  X 
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Solution.  Here  the  theorem  does  not  apply:  lim  l/n  —  0,  so  it  looks  like  perhaps  the  series  converges. 

ft— » <X> 

Indeed,  if  you  have  the  fortitude  (or  the  software)  to  add  up  the  first  1000  terms  you  will  find  that 


so  it  might  be  reasonable  to  speculate  that  the  series  converges  to  something  in  the  neighborhood  of  10. 
But  in  fact  the  partial  sums  do  go  to  infinity;  they  just  get  big  very,  very  slowly.  Consider  the  following: 


111  111  11 

1+2+3+4>1+2+4+4“1+2+2 

1111111  1111111  111 

1+2+3+4+5+6+7+8>1+2+4+4+8+8+8+8“1+2+2+2 
11  1  1111  11  1  111 
1  +  -  +  -  +  --  +  ->l  +  -  +  i  +  i  +  -  +  -  +  -  +  -  +  -  +  --l  +  -  +  -  +  -  + 


1 

2 


and  so  on.  By  swallowing  up  more  and  more  terms  we  can  always  manage  to  add  at  least  another  1/2 
to  the  sum,  and  by  adding  enough  of  these  we  can  make  the  partial  sums  as  big  as  we  like.  In  fact,  it’s  not 
hard  to  see  from  this  pattern  that 


.11  1 

1  +  -  +  -+-+- 


>  1  + 


n 

2’ 


so  to  make  sure  the  sum  is  over  100,  for  example,  we’d  add  up  terms  until  we  get  to  around  1/2198,  that 
is,  about  4  •  1059  terms.  This  series,  £(!/«),  is  called  the  harmonic  series.  £ 


We  will  often  make  use  of  the  fact  that  the  first  few  (e.g.  any  finite  number  of)  terms  in  a  series  are 
irrelevant  when  determining  whether  it  will  converge.  In  other  words,  £“=0  an  converges  if  and  only  if 
£”=;V  an  converges  for  some  TV  >  1 . 


Exercises  for  9.2 


Exercise  9.2.1  Explain  why  ^ 

n=  1 


In2  +  1 


diverges. 


Exercise  9.2.2  Explain  why  ^ 

n=  1 


5 

2l/n  + 14 


diverges. 


Exercise  9.2.3  Explain  why  )  -  diverges. 

^  n 


12=1 


Exercise  9.2.4  Compute 


3 

3"’ 
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~  3  4 

Exercise  9.2.5  Compute  2^  2"  5« ' 


;?=0 


°°  ^n+l 

Exercise  9.2.6  Compute  ^  ■ 


n=0 


°° 

Exercise  9.2.7  Compute  ^  • 

“  / 3X  " 

Exercise  9.2.8  Compute  I 

n=l  V 
°° 

Exercise  9.2.9  Compute  ^ 

«=  1  ^ 


9.3  The  Integral  Test 


It  is  generally  quite  difficult,  often  impossible,  to  determine  the  value  of  a  series  exactly.  In  many  cases 
it  is  possible  at  least  to  determine  whether  or  not  the  series  converges,  and  so  we  will  spend  most  of  our 
time  on  this  problem. 

If  all  of  the  terms  an  in  a  series  are  non-negative,  then  clearly  the  sequence  of  partial  sums  s„  is 
non-decreasing.  This  means  that  if  we  can  show  that  the  sequence  of  partial  sums  is  bounded,  the  series 
must  converge.  Many  useful  and  interesting  series  have  this  property,  and  they  are  among  the  easiest  to 
understand.  Let’s  look  at  an  example. 


Solution.  The  terms  l/n2  are  positive  and  decreasing,  and  since  lim  1  /x2  =  0,  the  terms  1  / n1  approach 

-T— 

zero.  We  seek  an  upper  bound  for  all  the  partial  sums,  that  is,  we  want  to  find  a  number  N  so  that  sn<N 
for  every  n.  The  upper  bound  is  provided  courtesy  of  integration,  and  is  illustrated  in  figure  9.2. 
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Figure  9.2:  Graph  of  y  —  1  /x1  with  rectangles. 


The  figure  shows  the  graph  of  y  —  1  /x2  together  with  some  rectangles  that  lie  completely  below  the 
curve  and  that  all  have  base  length  one.  Because  the  heights  of  the  rectangles  are  determined  by  the  height 
of  the  curve,  the  areas  of  the  rectangles  are  l/l2,  1/22,  1/32,  and  so  on — in  other  words,  exactly  the 
terms  of  the  series.  The  partial  sum  sn  is  simply  the  sum  of  the  areas  of  the  first  n  rectangles.  Because  the 
rectangles  all  lie  between  the  curve  and  the  x-axis,  any  sum  of  rectangle  areas  is  less  than  the  corresponding 
area  under  the  curve,  and  so  of  course  any  sum  of  rectangle  areas  is  less  than  the  area  under  the  entire  curve. 
Unfortunately,  because  of  the  asymptote  at  x  —  0,  the  integral  /0°°  \  is  infinite,  but  we  can  deal  with  this 
by  separating  the  first  term  from  the  series  and  integrating  from  1 : 

n  ]  n  i  r-n  j  rcx>  j 

sn  —  ~2  —  1  +  +  1  T  /  ~  dx  <!  1  +  /  ~2  dx  =  1  T  1  =  2 

■ ^  l  • ^  l  J IX  J  \  A 

(Recalling  that  we  computed  this  improper  integral  in  section  7.7).  Since  the  sequence  of  partial  sums  sn 
is  increasing  and  bounded  above  by  2,  we  know  that  lim  sn  —  L  <  2,  and  so  the  series  converges  to  some 

n— >°° 

number  less  than  2.  In  fact,  it  is  possible,  though  difficult,  to  show  that  L  —  n2/ 6  ~  1.6.  A 

We  already  know  that  £  1/m  diverges.  What  goes  wrong  if  we  try  to  apply  this  technique  to  it?  Here’s 
the  calculation: 


111  1  r  1  r  1  , 

;  =  —  +  —  +  —  +  --  -H —  <1+  /  —  dx  <  1  +  /  —  dx  =l+o°. 

12  3  n  J  ix  J  ix 

The  problem  is  that  the  improper  integral  doesn’t  converge.  Note  that  this  does  not  prove  that  £l/n 

diverges,  just  that  this  particular  calculation  fails  to  prove  that  it  converges.  A  slight  modification,  however, 
allows  us  to  prove  in  a  second  way  that  £ \/n  diverges. 

Consider  a  slightly  altered  version  of  Figure  9.2,  shown  in  Figure  9.3. 
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Figure  9.3:  Graph  of  y  —  1/x  with  rectangles. 


This  time  the  rectangles  are  above  the  curve,  that  is,  each  rectangle  completely  contains  the  corre¬ 
sponding  area  under  the  curve.  This  means  that 

111  1  rn+l  1  n+l 

s„  —  -  +  -  +  --\ - (-->  /  -dx  —  lnx  —  ln(n  +  1). 

12  3  n  J  ix  l 

As  n  gets  bigger,  In (n  +  1)  goes  to  inhnity,  so  the  sequence  of  partial  sums  sn  must  also  go  to  infinity,  so 
the  harmonic  series  diverges. 

The  key  fact  in  this  example  is  that 

rn+ 1  J  f°°  1 

lim  /  —  dx  —  /  -dx  —  oo 

1  x  ,/ 1  X 

So  these  two  examples  taken  together  indicate  that  we  can  prove  that  a  series  converges  or  prove  that  it 
diverges  with  a  single  calculation  of  an  improper  integral.  This  is  known  as  the  integral  test,  which  we 
state  as  a  theorem. 


Theorem  9.26:  Integral  Test 


Suppose  that  /(x)  >  0  and  is  decreasing  on  the  inhnite  interval  [1,°°)  and  thatan  —  /(«).  Then  the 

oo 

series  ^  an  converges  if  and  only  if  the  improper  integral 
11—  1 


/(x)  dx  converges. 


The  two  examples  we  have  seen  are  called  p-series;  a  p-series  is  any  series  of  the  form  ^1  /np .  If 
P  <  0,  lim  1  /np  /  0,  so  the  series  diverges.  For  positive  values  of  p  we  can  determine  precisely  which 

n—¥  oo 

series  converge. 


Theorem  9.27:  p-Series  Convergence 


A  p-series  with  p  >  0  converges  if  and  only  if  p  >  1 . 


Proof.  We  use  the  integral  test;  we  have  already  done  p  =  1,  so  assume  that  p  /  1 . 


lim  - 

D — 1  p  j 


D1~p  1 

lim - . 

zr-j-oo  1  —  p  1  —  p 
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If  p  >  1  then  1  —  p  <  0  and  lim  Dl  p  — 

o 

lim  D'  -i’  —  co.  so  the  integral  diverges. 

D^OO 


0,  so  the  integral  converges.  If  0  <  p  <  1  then  1  —  p  >  0  and 

* 


Solution.  We  could  of  course  use  the  integral  test,  but  now  that  we  have  the  theorem  we  may  simply  note 
that  this  is  a  p-series  with  p  >  1.  X 


Solution  We  know  that  if  ^  l/”4  converges  then  ^  5 / n4  also  converges,  by  Theorem  9.20.  Since 


n—  1 


n=  1 


^  1//74  is  a  convergent  p-series,  ^  5/?i4  converges  also. 


X 


n=  1 


n=  1 


Solution.  This  also  follows  from  Theorem  9.20:  Since  ^ 

n=  1 

00  5 

and  so  does  ^ 


/2=  1 


is  a  p-series  with  p  =  1/2  <  1,  it  diverges, 

X 


Since  it  is  typically  difficult  to  compute  the  value  of  a  series  exactly,  a  good  approximation  is  fre¬ 
quently  required.  In  a  real  sense,  a  good  approximation  is  only  as  good  as  we  know  it  is,  that  is,  while  an 
approximation  may  in  fact  be  good,  it  is  only  valuable  in  practice  if  we  can  guarantee  its  accuracy  to  some 
degree.  This  guarantee  is  usually  easy  to  come  by  for  series  with  decreasing  positive  terms. 


N 

Solution.  Referring  to  Figure  9.2,  if  we  approximate  the  sum  by  ^  1  /n2.  the  size  of  the  error  we  make  is 

n—  1 

the  total  area  of  the  remaining  rectangles,  all  of  which  lie  under  the  curve  1/x2  from  x  =  N  to  infinity.  So 
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we  know  the  true  value  of  the  series  is  larger  than  the  approximation,  and  no  bigger  than  the  approximation 
plus  the  area  under  the  curve  from  N  to  infinity.  Roughly,  then,  we  need  to  find  N  so  that 


f  4 

JN  X 1 


dx  <  1/100. 


We  can  compute  the  integral: 


1 

N’ 


so  if  we  choose  N  —  100  the  error  will  be  less  than  0.01.  Adding  up  the  first  100  terms  gives  approxi¬ 
mately  1.634983900.  In  fact,  we  can  do  a  bit  better.  Since  we  know  that  the  correct  value  is  between  our 
approximation  and  our  approximation  plus  the  error  (not  minus),  we  can  cut  our  error  bound  in  half  by 
taking  the  value  midway  between  these  two  values.  If  we  take  N  =  50,  we  get  a  sum  of  1.6251327  with  an 
error  of  at  most  0.02,  so  the  correct  value  is  between  1.6251327  and  1.6451327,  and  therefore  the  value 
halfway  between  these,  1.6351327,  is  within  0.01  of  the  correct  value.  We  have  mentioned  that  the  true 
value  of  this  series  can  be  shown  to  be  k2/6  m  1.644934068  which  is  0.0098  more  than  our  approxima¬ 
tion,  and  so  (just  barely)  within  the  required  error.  Frequently  approximations  will  be  even  better  than  the 
“guaranteed”  accuracy,  but  not  always,  as  this  example  demonstrates.  & 


Exercises  for  9.3 


Determine  whether  each  series  converges  or  diverges. 


Exercise  9.3.1  ^ 


1 


n=  1 


%*/ 4 


Exercise  9.3.5  Y  — 


n=  1 


Exercise  9.3.2  Y  — 

^  n2  +  1 


n=  1 


Exercise  9.3.6  Y  — 

“  en 

n=  1 


Exercise  9.3.3  Y 

^  n2 


n=l 


Exercise  9.3.7  ^ 


1 


n= 2 


n  In  n 


Exercise  9.3.4  Y  - 

n2  + 1 


Exercise  9.3.8  ^ 


1 


n=  1 


„=2  »(ln,f) 


N 


N 


Exercise  9.3.9  Find  an  N  so  that  Y  — r  is  between  Y  — r  and  Y  ~ r  +  0.005. 

L-‘.  n4  n4  n4 

n=  1  n=  1  n—  1 

°°  y  1  w  1 

Exercise  9.3.10  Find  an  N  so  that  Y  —  is  between  Y  —  and  Y - b  10~4. 

w  „n  L*  0n  L-t  an 


n= 0  ' 


pll  Mrnrnm  pH 

n= 0  ^  n=0  * 


Exercise  9.3.11  Find  an  N  so  that  V  — is  between  V  — and  Y*  — +  0.005. 

n~  n  d. 


n=  1 


n—  1 


n=  1 
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Exercise  9.3.12  Find  an  N  so  that  ^ 


N 


N 


~2n(\nn) 


is  between  ^ 


and  ^ 


“2n(\nn)-  “2n(lnn 


+  0.005. 


9.4  Alternating  Series 


Next  we  consider  series  with  both  positive  and  negative  terms,  but  in  a  regular  pattern:  they  alternate,  as 

in  the  alternating  harmonic  series: 


y  (-1)”"1 

^  n 


1  -1  1 

T  +  T  +  3  + 


1  1  1 
I~2  +  3 


In  this  example  the  magnitude  of  the  terms  decrease,  that  is,  \an\  forms  a  decreasing  sequence,  although 
this  is  not  required  in  an  alternating  series.  Recall  that  for  a  series  with  positive  terms,  if  the  limit  of  the 
terms  is  not  zero,  the  series  cannot  converge;  but  even  if  the  limit  of  the  terms  is  zero,  the  series  still  may 
not  converge.  It  turns  out  that  for  alternating  series,  the  series  converges  exactly  when  the  limit  of  the 
terms  is  zero.  In  Figure  9.4,  we  illustrate  what  happens  to  the  partial  sums  of  the  alternating  harmonic 
series.  Because  the  sizes  of  the  terms  an  are  decreasing,  the  odd  partial  sums  .sq ,  s3,  s$ .  and  so  on,  form 
a  decreasing  sequence  that  is  bounded  below  by  S2,  so  this  sequence  must  converge.  Likewise,  the  even 
partial  sums  S2,  s 4,  S(„  and  so  on,  form  an  increasing  sequence  that  is  bounded  above  by  s\ ,  so  this  sequence 
also  converges.  Since  all  the  even  numbered  partial  sums  are  less  than  all  the  odd  numbered  ones,  and 
since  the  “jumps”  (that  is,  the  a,-  terms)  are  getting  smaller  and  smaller,  the  two  sequences  must  converge 
to  the  same  value,  meaning  the  entire  sequence  of  partial  sums  si,S2,s3, . . .  converges  as  well. 


a2  =  ~\ 


< - 

04 

< - - - 

a6 

f - - - 

I  52  =  \ 

4 

s6 

a5 

s5 

- > 

^3 

=  S\  = 

a3 


Figure  9.4:  The  alternating  harmonic  series. 


The  same  argument  works  for  any  alternating  sequence  with  terms  that  decrease  in  absolute  value.  The 
alternating  series  test  is  worth  calling  a  theorem. 
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Proof.  The  odd-numbered  partial  sums,  51,53,55, . . .  ,52*+i,  •  •  •>  f°rm  a  non-increasing  sequence,  because 
$2k+3  —  s2k+ 1  —®2k+2  +  ®2k+3  <  s2k+  \  ■>  since  ci2k+2  >  «2fc+3-  This  sequence  is  bounded  below  by  52,  so 
it  must  converge,  to  some  value  L.  Likewise,  the  partial  sums  52,54,56, . . . ,52*, . . .,  form  a  non-decreasing 
sequence  that  is  bounded  above  by  51,  so  this  sequence  also  converges,  to  some  value  M.  Since  lim  an  —  0 

n— 7-°° 

and  52*+ 1  =  s2k  +  a2k+  \ , 

L  =  lim  52*+ 1  =  lim  (52*  +  a2*+i )  =  lim  52*  +  lim  a2*+ 1  —  M  +  0  —  M, 

*— 7°°  *  ^00  *— T-oo  *— 7° O 

so  L  —  M\  the  two  sequences  of  partial  sums  converge  to  the  same  limit,  and  this  means  the  entire  sequence 
of  partial  sums  also  converges  to  L.  X 

Another  useful  fact  is  implicit  in  this  discussion.  Suppose  that 

co 

T=K-ir1% 

n=  1 

and  that  we  approximate  L  by  a  finite  part  of  this  sum,  say 

N 

n=  1 

Because  the  terms  are  decreasing  in  size,  we  know  that  the  true  value  of  L  must  be  between  this  approxi¬ 
mation  and  the  next  one,  that  is,  between 

N  N+ 1 

J2(-l  )n^an  and  J^(-l  )n~lan. 

n=  1  n=l 

Depending  on  whether  N  is  odd  or  even,  the  second  will  be  larger  or  smaller  than  the  first. 


Example  9.33: 


Approximate  the  sum  of  the  alternating  harmonic  series  to  within  0.05. 


Solution.  We  need  to  go  to  the  point  at  which  the  next  term  to  be  added  or  subtracted  is  1/ 10.  Adding  up 
the  first  nine  and  the  first  ten  terms  we  get  approximately  0.746  and  0.646.  These  are  1/10  apart,  so  the 
value  halfway  between  them,  0.696,  is  within  0.05  of  the  correct  value.  X 

We  have  considered  alternating  series  with  first  index  1,  and  in  which  the  first  term  is  positive,  but  a 

00 

little  thought  shows  this  is  not  crucial.  The  same  test  applies  to  any  similar  series,  such  as  ^  (— 1  )"an, 

77=0 

OO  OO 

£(-l )%.  £(-l)'+,etc. 

71=1  71=17 


Exercises  for  9.4 


Determine  whether  the  following  series  converge  or  diverge. 


358  Sequences  and  Series 


Exercise  9.4.1 


Exercise  9.4.2 


Exercise  9.4.5 


Exercise  9.4.6 


oo 


I 

n=  1 


2n  +  5 


y  (-I)”"1 
n=4 


OO 

Exercise  9.4.3  V(— l)”-1  -  - 

4-, v  '  3n  —  2 


Exercise  9.4.4  ^(—  1)”  1 

n=  1 


Inn 

n 


OO  | 

Approximate  ^  (— 1)'!_1  -j 


oo  j 

Approximate  ^  (— 1)"  1  — ^ 

72=1  ^ 


to  within  0.005. 


to  within  0.005. 


9.5  Comparison  Tests 


As  we  begin  to  compile  a  list  of  convergent  and  divergent  series,  new  ones  can  sometimes  be  analyzed  by 
comparing  them  to  ones  that  we  already  understand. 


Solution.  The  obvious  first  approach,  based  on  what  we  know,  is  the  integral  test.  Unfortunately,  we 
can’t  compute  the  required  antiderivative.  But  looking  at  the  series,  it  would  appear  that  it  must  converge, 
because  the  terms  we  are  adding  are  smaller  than  the  terms  of  a  p-series,  that  is, 

1  1 

n2\nn  <  n 2’ 

when  n  >  3.  Since  adding  up  the  terms  l/n2  doesn’t  get  “too  big”,  the  new  series  “should”  also  converge. 
Let’s  make  this  more  precise. 

OO  J  OO  | 

The  series  ^  —  converges  if  and  only  if  ^  — —  converges — all  we’ve  done  is  dropped  the 

/->  TX  111  VI  n  VI  111  /? 

n—Z  n=5 

OO  2 

initial  term.  We  know  that  \  —  converges.  Looking  at  two  typical  partial  sums: 

— ,  nl 

n= 3 


Sn 


32ln3  42ln4  52ln5  n-\nn  32  42  52 


Since  the  p-series  converges,  say  to  L,  and  since  the  terms  are  positive,  tn  <  L.  Since  the  terms  of  the  new 
series  are  positive,  the  sn  form  an  increasing  sequence  and  sn  <tn  <  L  for  all  n.  Hence  the  sequence  {.v„} 
is  bounded  and  so  converges.  X 
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Sometimes,  even  when  the  integral  test  applies,  comparison  to  a  known  series  is  easier,  so  it’s  generally 
a  good  idea  to  think  about  doing  a  comparison  before  doing  the  integral  test. 


Example  9.35: 

^  sin/7 
Does  >  — =— 
“  n2 

n= 2 

-  converge? 

Solution.  We  can’t  apply  the  integral  test  here,  because  the  terms  of  this  series  are  not  decreasing.  Just  as 
in  the  previous  example,  however, 

|  sin  1  1 

nz  ?7Z 

because  |  sin/i|  <  1.  Once  again  the  partial  sums  are  non-decreasing  and  bounded  above  by  ^  \/n2  —  L, 
so  the  new  series  converges.  X 

Like  the  integral  test,  the  comparison  test  can  be  used  to  show  both  convergence  and  divergence.  In 
the  case  of  the  integral  test,  a  single  calculation  will  confirm  whichever  is  the  case.  To  use  the  comparison 
test  we  must  first  have  a  good  idea  as  to  convergence  or  divergence  and  pick  the  sequence  for  comparison 
accordingly. 


Solution.  We  observe  that  the  —3  should  have  little  effect  compared  to  the  n  inside  the  square  root,  and 
therefore  guess  that  the  terms  are  enough  like  1  /  Vn 2  —  l/n  that  the  series  should  diverge.  We  attempt  to 
show  this  by  comparison  to  the  harmonic  series.  We  note  that 

1  1  _  1 

y/  n2  —  3  y/n2  n 

so  that 

1  1  1111 

y/2-  —  3  \/3-  —  3  y/n-  —  3  2  3  n 

where  tn  is  1  less  than  the  corresponding  partial  sum  of  the  harmonic  series  (because  we  start  at  n  —  2 
instead  of  n  =  1).  Since  lim  tn  —  °°,  lim  sn  —  °°  as  well.  X 

n—¥  °°  n— 

So  the  general  approach  is  this:  If  you  believe  that  a  new  series  is  convergent,  attempt  to  find  a  con¬ 
vergent  series  whose  terms  are  larger  than  the  terms  of  the  new  series;  if  you  believe  that  a  new  series  is 
divergent,  attempt  to  find  a  divergent  series  whose  terms  are  smaller  than  the  terms  of  the  new  series. 
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Solution.  Just  as  in  the  last  example,  we  guess  that  this  is  very  much  like  the  harmonic  series  and  so 
diverges.  Unfortunately, 

1  1 

Vn2  +  3  n 

so  we  can’t  compare  the  series  directly  to  the  harmonic  series.  A  little  thought  leads  us  to 

1  1  _  1 

V  n2  +  3  V  n2  +  3  n2  2  n 

so  if  11/ (2/7)  diverges  then  the  given  series  diverges.  But  since  £  1/ ( 2/7 )  =  (1/2)  £  1  /n.  Theorem  9.20 
implies  that  it  does  indeed  diverge.  X 

For  reference  we  summarize  the  comparison  test  in  a  theorem. 


Exercises  for  9.5 


Determine  whether  the  series  converge  or  diverge. 

1 


Exercise  9.5.1  ^ 


2/7“  +  3/7  +  5 


Exercise  9.5.6  ^ 


In  77 


n=  1 


Exercise  9.5.2  ^ 


1 


n=2  ' 


+  3/7  —  5 


00  In  yi 

Exercise  9.5.7  y  — r 


n=  1 


Exercise  9.5.3  Y  — r 

“  2n2 


1 


n—i 


3/7  —  5 


„  .  oc  .  f  3/7  +  4 

Exercise  9.5.4  )  — ~ - - 

“ i  2«2  +  3/7  +  5 

“  3/72  +  4 


“j  2 n2  +  3/7  +  5 


Exercise  9.5.8  ^ 


n= 2 


In // 


00  0/7 


Exercise  9.5.9  ^ 


“  2"  +  5,: 


00  0/7 


Exercise  9.5.10  ^ 


“  2”  +  3;i 


Exercise  9.5.5 
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9.6  Absolute  Convergence 


Roughly  speaking  there  are  two  ways  for  a  series  to  converge:  As  in  the  case  of  Y  1  /«2,  the  individual  terms 
get  small  very  quickly,  so  that  the  sum  of  all  of  them  stays  finite,  or,  as  in  the  case  of  £(— 1  )"-1/n,  the 
terms  don’t  get  small  fast  enough  (Y  l/«  diverges),  but  a  mixture  of  positive  and  negative  terms  provides 
enough  cancellation  to  keep  the  sum  finite.  You  might  guess  from  what  we’ve  seen  that  if  the  terms 
get  small  fast  enough  that  the  sum  of  their  absolute  values  converges,  then  the  series  will  still  converge 
regardless  of  which  terms  are  actually  positive  or  negative. 


Proof.  Note  that  0  <  a„  A  \an\  <  2\an\  so  by  the  comparison  test  £  (an  +  Wn\)  converges.  Now 

n=0 


ifln  A  1^7*1)  \®n\  Ai  A  |  rZ/7 1  |  <3/7 1  0.n 

n=0  n= 0  ;z=0  n= 0 

converges  by  Theorem  9.20.  X 

So  given  a  series  Y an  with  both  positive  and  negative  terms,  you  should  first  ask  whether  Y  A;  con¬ 
verges.  This  may  be  an  easier  question  to  answer,  because  we  have  tests  that  apply  specifically  to  terms 
with  non-negative  terms.  If  Y  Wn\  converges  then  you  know  that  converges  as  well.  If  Y  Wn\  diverges 
then  it  still  may  be  true  that  Yan  converges,  but  you  will  need  to  use  other  techniques  to  decide.  Intuitively 
this  results  says  that  it  is  (potentially)  easier  for  Yan  to  converge  than  for  Y  \an\  to  converge,  because  terms 
may  partially  cancel  in  the  first  series. 

If  Y  \an\  converges  we  say  that  Y A;  converges  absolutely;  to  say  that  Yan  converges  absolutely  is  to 
say  that  the  terms  of  the  series  get  small  (in  absolute  value)  quickly  enough  to  guarantee  that  the  series 
converges,  regardless  of  whether  any  of  the  terms  cancel  each  other.  If  Yan  converges  but  Y\an\  does 

not,  we  say  that  Y an  converges  conditionally.  For  example  V  (—  1 1  converges  absolutely,  while 

,  nz 

n=  1 

OO  j 

^  (  —  l)n_1  -  converges  conditionally. 

n=  1  ^ 


Solution.  In  Example  9.35  we  saw 


that  £ 


sin  77 


n= 2 


converges,  so  the  given  series  converges  absolutely. 

X 
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3n  +  4 


Solution.  Taking  the  absolute  value,  V  — ^ - diverges  by  comparison  to  V  — — ,  so  if  the  series 

“n2nz  +  3n  +  5  “i  10/7 


n= 0  ' 


n—  1 


converges  it  does  so  conditionally.  It  is  true  that  lim  (3/7+4) /(2/z2 +  3/7  +  5)  =  0,  so  to  apply  the  alternating 

n—¥  °° 


series  test  we  need  to  know  whether  the  terms  are  decreasing.  If  we  let  f(x)  =  (3x  +  4)  / (lx2  +  3x  +  5) 
then  f{x)  —  —  (6x2  +  1 6x  —  3)/ (2x2  +  3x  +  5)2,  and  it  is  not  hard  to  see  that  this  is  negative  for  x  >  1,  so 
the  series  is  decreasing  and  by  the  alternating  series  test  it  converges.  & 


Exercises  for  9.6 


Determine  whether  each  series  converges  absolutely,  converges  conditionally,  or  diverges. 


Exercise  9.6.1  ^  ( — 1 ) 


n—  1 


n=  1 


n=  1 


Exercise  9.6.3  ^  ( — 1 ) 

72=1 


n=  1 


1 


2  n2  +  3/7  +  5 


Exercise  9.6.2  ^  ( — 1 ) "  1  — y 


3/?2  +  4 


+  3/z  +  5 


71 —  1 


In// 


In  // 


Exercise  9.6.4  ^  ( — 1 ) "  — y 


Exercise  9.6.5  ^(— 1)' 

n=2 

00 

Exercise  9.6.6  ^(— 1)' 


«=o 


Exercise  9.6.7  ^(— 1)'! 


77=0 


Exercise  9.6.8  ^  ( — 1 ) 


1 

Inn 

377 

2n  +  5': 

377 

2n  +  3r' 


n—\ 


n_i  arctan/7 
n 


9.7  The  Ratio  and  Root  Tests 


r— ^  fl 

Does  the  series  ^  7^  converge?  It  is  possible,  but  a  bit  unpleasant,  to  approach  this  with  the  integral  test 
77=0 

or  the  comparison  test,  but  there  is  an  easier  way.  Consider  what  happens  as  we  move  from  one  term  to 
the  next  in  this  series: 


rr  (/7+lV 

- 1 - h  - - — 

5n  5n+1 


+  ... 
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The  denominator  goes  up  by  a  factor  of  5,  5"+1  =5-5”,  but  the  numerator  goes  up  by  much  less: 
(n  +  l)5  =  ?75  +  5/74  +  10/?3  +  10/7~  +  5/7  +  1,  which  is  much  less  than  5 n5  when  n  is  large,  because  5//4  is 
much  less  than  n5 .  So  we  might  guess  that  in  the  long  run  it  begins  to  look  as  if  each  term  is  1/5  of  the 
previous  term.  We  have  seen  series  that  behave  like  this:  The  geometric  series. 


1  _5 
5"  4’ 

So  we  might  try  comparing  the  given  series  to  some  variation  of  this  geometric  series.  This  is  possible, 
but  a  bit  messy.  We  can  in  effect  do  the  same  thing,  but  bypass  most  of  the  unpleasant  work. 

The  key  is  to  notice  that 


lim 

n—¥  oo 


an+ 1 
dn 


(/7  +  l)5  5” 

lim  — — — - - - 

n— >oo  S'1-*"1  rr 


lim 

n—>  oo 


(n+  l)5  1 

n5  5 


1 


5’ 


This  is  really  just  what  we  noticed  above,  done  a  bit  more  formally:  in  the  long  run,  each  term  is  one 
fifth  of  the  previous  term.  Now  pick  some  number  between  1/5  and  1,  say  1/2.  Because 


lim - 

n^oo  an 


1 

5’ 


then  when  n  is  big  enough,  say  n  >N  for  some  N, 


dn+ 1  1 

Cln  2 


dn 

so  an+ 1  < 


So  au- |_i  <  ayy/2,  d^+2  <  a/v+t/2  <  a^/4,  a^+ 3  <  a^+ 2/2  <  a^j 8,  and  so  on.  The  general  form  is 
dfi+k  <  d,\'/2k.  So  if  we  look  at  the  series 


aN+k  ~  aM  +  <TV+i  +  «yv+2  +  <3/V+3  H - h  <3/V+/t  H - » 

k= 0 

its  terms  are  less  than  or  equal  to  the  terms  of  the  sequence 


.  aN  ,  dfrf  «7V  ,  ,  U7V  ,  t*7V  ^ 

aN  +  ^  +  ^-  +  ir  +  ---  +  ^k+---=  L^k=laN- 


dN 


dN 


8 


2k 


k= 0 


2k 


00  00 

So  by  the  comparison  test,  ^  aN+k  converges,  and  this  means  that  ^  an  converges,  since  we’ve  just 

k=Q  n= 0 

added  the  fixed  number  gq  +  a\  H - h  . 

Under  what  circumstances  could  we  do  this?  What  was  crucial  was  that  the  limit  of  an+i/an,  say  L, 
was  less  than  1  so  that  we  could  pick  a  value  r  so  that  L  <  r  <  1.  The  fact  that  L<r(l/5<l/2in  our 
example)  means  that  we  can  compare  the  series  £  dn  to  £V\  and  the  fact  that  r  <  1  guarantees  that  £r" 
converges.  That’s  really  all  that  is  required  to  make  the  argument  work.  We  also  made  use  of  the  fact  that 
the  terms  of  the  series  were  positive;  in  general  we  simply  consider  the  absolute  values  of  the  terms  and 
we  end  up  testing  for  absolute  convergence. 
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Theorem  9.42:  The  Ratio  Test 


Suppose  that  lim  \au+  \  /an\  —  L.  If  L  <  1  the  series  £ an  converges  absolutely,  if  L>  1  the  series 

n— >°° 

diverges,  and  ifL  —  1  this  test  gives  no  information. 


Proof.  The  example  above  essentially  proves  the  first  part  of  this,  if  we  simply  replace  1/5  by  L  and  1/2 
by  r.  Suppose  that  L  >  1,  and  pick  r  so  that  1  <  r  <  L.  Then  for  n  >  N,  for  some  N, 


K+t| 


>r  and  \an+i\  >  r\an\. 


This  implies  that  \a^+k\  >  •  |ajv|>  but  since  r>  1  this  means  that  lim  \a^+k\  ^  0,  which  means  also 

>  °o 

that  lim  an  ^  0.  By  the  divergence  test,  the  series  diverges. 

n— 

To  see  that  we  get  no  information  when  L  —  1,  we  need  to  exhibit  two  series  with  L  =  1,  one  that 
converges  and  one  that  diverges.  The  series  £  1/n2  and  £  l/n  provide  a  simple  example.  X 

The  ratio  test  is  particularly  useful  for  series  involving  the  factorial  function. 


Example  9.43: 


Solution. 

5"+1  n\  5'1+1  n\  1 

lim  - —  —  =  lim  — — 7 - —  =  lim  5- - —  =  0. 

(n  +  1 ) !  5”  n->co  5”  (n  +  1)!  (n+1) 

Since  0  <  1,  the  series  converges. 

A  similar  argument  justifies  a  similar  test  that  is  occasionally  easier  to  apply. 


X 


Theorem  9.44:  The  Root  Test 

Suppose  that  lim  \a„\ 

n— too 

diverges,  and  ifL  —  1 

\1/'1  =  L.  If  L  <  1  the  series  converges  absolutely,  if  L  >  1  the  series 

this  test  gives  no  information. 

The  proof  of  the  root  test  is  actually  easier  than  that  of  the  ratio  test,  and  is  left  as  an  exercise. 


Solution.  The  ratio  test  turns  out  to  be  a  bit  difficult  on  this  series  (try  it).  Using  the  root  test: 

/  5"\ 1/,!  (5”)1/,!  5 

lim  —  =  lim - -  —  lim  -  = 

n— \nn  J  n— ( n np/n 


0. 
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Since  0  <  1,  the  series  converges.  & 

The  root  test  is  frequently  useful  when  n  appears  as  an  exponent  in  the  general  term  of  the  series. 

Exercises  for  9.7 


Exercise  9.7.1  Compute  lim  \an+i/ an  \  for  the  series  1  / rr. 

n-^r  °° 

Exercise  9.7.2  Compute  lim  \an+i/an\for  the  series  £  l/ri. 

n— 

Exercise  9.7.3  Compute  lim  |  an  | 1  for  the  series  £  1/ n2. 

n-^r  oo 

Exercise  9.7.4  Compute  lim  \a„\],/n  for  the  series  £  1  /n. 


Exercise  9.7.5  Determine  whether  the  series  converge. 


(a)  £(-l) 

n=0 


Dll 


(c>  £ 


(b)  £ 


n\ 


n=  1 


OO  5 

rr 


n=  1 


«=1 


Exercise  9.7.6  Prove  Theorem  9.44,  the  root  test. 


9.8  Power  Series 


Recall  that  the  sum  of  a  geometric  series  can  be  expressed  using  the  simple  formula: 


n= 0 


k 

\—x 


if  \x\  <  1,  and  that  the  series  diverges  when  |jc|  >  1.  At  the  time,  we  thought  of  x  as  an  unspecified  constant, 
but  we  could  just  as  well  think  of  it  as  a  variable,  in  which  case  the  series 


L  kx’1 

n= 0 

is  a  function,  namely,  the  function  k/{  1  —  x),  as  long  as  \x\  <  1:  Looking  at  this  from  the  opposite  per¬ 
spective,  this  means  that  the  function  k/(l  —  x)  can  be  represented  as  the  sum  of  an  infinite  series.  Why 
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would  this  be  useful?  While  k/(  1  —  x)  is  a  reasonably  easy  function  to  deal  with,  the  more  complicated 
representation  £ kxn  does  have  some  advantages:  it  appears  to  be  an  infinite  version  of  one  of  the  simplest 
function  types — a  polynomial.  Later  on  we  will  investigate  some  of  the  ways  we  can  take  advantage  of 
this  ‘infinite  polynomial’  representation,  but  first  we  should  ask  if  other  functions  can  even  be  represented 
this  way. 

The  geometric  series  has  a  special  feature  that  makes  it  unlike  a  typical  polynomial — the  coefficients 
of  the  powers  of  x  are  all  the  same,  namely  k.  We  will  need  to  allow  more  general  coefficients  if  we  are  to 
get  anything  other  than  the  geometric  series. 


As  we  did  in  the  section  on  sequences,  we  can  think  of  the  a„  as  being  a  function  a(n)  defined  on  the 
non-negative  integers.  Note,  however,  that  the  an  do  not  depend  on  x. 


Solution.  We  can  investigate  convergence  using  the  ratio  test: 


.in+i 


n 


lim  - — 
ii— z°o  n+  1  \x  ; 


=  lim  Ijcl 


n+  1 


=  x  . 


Thus  when  |x|  <  1  the  series  converges  and  when  |x|  >  1  it  diverges,  leaving  only  two  values  in  doubt. 
When  x  =  1  the  series  is  the  harmonic  series  and  diverges;  when  x  =  —  1  it  is  the  alternating  harmonic 
series  (actually  the  negative  of  the  usual  alternating  harmonic  series)  and  converges.  Thus,  we  may  think 


A  bit  of  thought  reveals  that  the  ratio  test  applied  to  a  power  series  will  always  have  the  same  nice 
form.  In  general,  we  will  compute 


lim  |fl„+i||x|”+1 
n-+°°  laJlxl” 


lim  |x| 

tl — Z°o 


|^/i+l  | 
|  fl?i  | 


i  r  l^n+l  |  T | 
|x|  lim  — — —  =  L  x 
w-+°°  \a\ 


assuming  that  lim|a„+i|/|a„|  exists.  Then  the  series  converges  if  L|x|  <  1,  that  is,  if  |x|  <  1/L,  and 
diverges  if  |x|  >  1/L.  Only  the  two  values  x  =  ±1/L  require  further  investigation.  Thus  the  series  will 
always  define  a  function  on  the  interval  (  —  1/L,  1/L),  that  perhaps  will  extend  to  one  or  both  endpoints  as 
well.  Two  special  cases  deserve  mention:  if  L  =  0  the  limit  is  0  no  matter  what  value  x  takes,  so  the  series 
converges  for  all  x  and  the  function  is  defined  for  all  real  numbers.  If  L  =  °°,  then  no  matter  what  value  x 
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takes  the  limit  is  infinite  and  the  series  converges  only  when  x  =  0.  The  value  1/L  is  called  the  radius  of 
convergence  of  the  series,  and  the  interval  on  which  the  series  converges  is  the  interval  of  convergence. 

We  can  make  these  ideas  a  bit  more  general.  Consider  the  series 


I 


n= 0 


(x  +  2)” 
3  n 


This  looks  a  lot  like  a  power  series,  but  with  (x  +  2)”  instead  of  xn .  Let’s  try  to  determine  the  values  of  x 
for  which  it  converges.  This  is  just  a  geometric  series,  so  it  converges  when 


|x  +  2|/3  <  1 
|x  +  2|  <3 
—3  <  x  +  2 < 3 
— 5  <  x  <  1. 


So  the  interval  of  convergence  for  this  series  is  (—5, 1).  The  center  of  this  interval  is  at  —2,  which  is  at 
distance  3  from  the  endpoints,  so  the  radius  of  convergence  is  3,  and  we  say  that  the  series  is  centered  at 
-2. 

Interestingly,  if  we  compute  the  sum  of  the  series  we  get 


co 


I 

n=0 


1 


i  _ x+2 

1  3 


3 

1  — x 


Multiplying  both  sides  by  1/3  we  obtain 


I 


/?=0 


(x  +  2)” 
3”+1 


1 

1  —  x 


which  we  recognize  as  being  equal  to 

oo 

I*", 

n=0 

so  we  have  two  series  with  the  same  sum  but  different  intervals  of  convergence. 
This  leads  to  the  following  definition: 
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Exercises  for  9.8 


Exercise  9.8.1  Find  the  radius  and  interval  of  convergence  for  each  series.  In  part  c),  do  not  attempt  to 
determine  whether  the  endpoints  are  in  the  inten’al  of  convergence. 


oo  oo 


(a)  ^nxn 

(d)  E 

n= 0 

71=1 

oo  vi 

(b) 

oo 

(e)  L 

71=0 

71=1 

OO  | 

M  E>-2>" 

71=1 

(x  +  5)” 
n{n  + 1) 


Exercise  9.8.2  Find  the  radius  of  convergence  for  the  series 


9.9  Calculus  with  Power  Series 


We  now  know  that  some  functions  can  be  expressed  as  power  series,  which  look  like  infinite  polynomials. 
Since  it  is  easy  to  find  derivatives  and  integrals  of  polynomials,  we  might  hope  that  we  can  take  derivatives 
and  integrals  of  power  series  in  an  analogous  way.  In  fact  we  can,  as  stated  in  the  following  theorem,  which 
we  will  not  prove  here. 


Example  9.50: 

Find  a  power  series  representation  of  In 

1  —  X 

9.9.  Calculus  with  Power  Series  369 


Solution.  Starting  with  the  geometric  series: 


1 


1  —  x 


I*" 


71=0 


1  °°  1 

- dx  —  —  In  1 1  —  x\  —  V  - xn+1 

l  —X  nn  +  1 

n=i) 


In  1 1  —  jc|  =  y 


71=0 


n  +  1 


-xn+] 


when  |x|  <  1.  The  series  does  not  converge  when  x  —  1  but  does  converge  when  x——  1  or  1  —  x  —  2.  The 
interval  of  convergence  is  [— 1, 1),  or  0  <  1  —  x  <  2.  We  can  use  this  series  to  express  ln(a)  as  a  series 
when  0  <  a  <  2  by  setting  x  —  1  —  a.  For  example 


ln(3/2)  =  ln(l  — (-1/2))=  £(-!)”■ 


77=0 


We  can  use  this  in  turn  to  approximate  ln(3/2): 


1  1 
TT  2n+l ' 


909 


,,oW  1  1  1  1  1  1  1 

v  '  ’  2  8  24  64  160  384  896  2240 


0.406. 


Because  this  is  an  alternating  series  with  decreasing  terms,  we  know  that  the  true  value  is  between 
909/2240  and  909/2240  -  1/2048  -  29053/71680  «  .4053,  so  0.4053  <  ln(3/2)  <  0.406.  X 

With  a  bit  of  arithmetic,  we  can  approximate  values  outside  of  the  interval  of  convergence: 


Example  9.51: 


Find  an  approximation  for  ln(9/4). 


Solution.  We  can  use  the  approximation  we  just  computed,  plus  some  rules  for  logarithms: 

ln(9/4)  -  ln((3/2)2)  =  21n(3/2)  w  0.812, 

and  using  our  bounds  above, 

0.8106  <ln(9/4)  <0.812. 


X 


Exercises  for  9.9 


Exercise  9.9.1  Find  a  series  representation  for  In  2. 

Exercise  9.9.2  Find  a  power  series  representation  for  1/(1  —x)2. 
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Exercise  9.9.3  Find  a  power  series  representation  for  2/(1  —  x)3. 

Exercise  9.9.4  Find  a  power  series  representation  for  1/(1  —  jc)3.  What  is  the  radius  of  convergence? 
Exercise  9.9.5  Find  a  power  series  representation  for  /  ln(l  —  x)  dx. 


9.10  Taylor  Series 


We  have  seen  that  some  functions  can  be  represented  as  series,  which  may  give  valuable  information 
about  the  function.  So  far,  we  have  seen  only  those  examples  that  result  from  manipulation  of  our  one 
fundamental  example,  the  geometric  series.  We  would  like  to  start  with  a  given  function  and  produce  a 
series  to  represent  it,  if  possible. 

oo 

Suppose  that  /(x)  —  ^  anxn  on  some  interval  of  convergence  centered  at  0.  Then  we  know  that  we 

n= o 

can  compute  derivatives  of  /  by  taking  derivatives  of  the  terms  of  the  series.  Let’s  look  at  the  first  few  in 
general: 

oo 

f'(x)  —  £  nanxn~ 1  —  a  i  +  2a2x  +  3a?,x2  +  4r/4X3  H - 

n=l 

oo 

f"(x)  —  £  n(n  —  1  )anxn~2  —  2a2  +  3  •  2<33X  +  4  •  3«4X2  4 - 

n— 2 

oo 

f" (x)  —  £  n(n  —  l)(n  —  2 )a„xn~3  =  3  ■  2«3  +  4  •  3  •  2apc-\ - 

n=3 


By  examining  these  it’s  not  hard  to  discern  the  general  pattern.  The  kth  derivative  must  be 

oo 

f^k\x)  —  £  nin  —  l)(n  —  2)  ■  •  ■  [n  —  k+  1  )anxn~k 

n=k 

=  k(k-l)(k-2)---(2)(l)ak+(k+l)(k)--f2)ak+lx  + 

+  (k  +  2)  (k  +  1)  •  •  •  (3)rz£-j-2^  +  •  •  • 


We  can  express  this  more  clearly  by  using  factorial  notation: 

f(k) M  =  £  (  n\\,an^~k  =  k-ak  +  (k+  \)\ak+xx  +  (^^1ak+ 2x2  +  •■■ 
„%(n-k)l  2! 

We  can  solve  for  an  by  substituting  x  —  0  in  the  formula  for  / ^  (x) : 

oo  | 

f®(0)  =  k\ak+  £  -^—an0n-k  =  k\ak, 

n=k+ 1  v1  K>  ! 
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_fk)i  0) 

Uk  k\  ‘ 

Note  that  the  original  series  for  /  yields  /( 0)  =  c/o- 

So  if  a  function  /  can  be  represented  by  a  series,  we  can  easily  find  such  a  series.  Given  a  function  /, 
the  series 

y 

n  w! 

72=0 

is  called  the  Maclaurin  series  for  /. 


Solution.  We  need  to  compute  the  derivatives  of  /  (and  hope  to  spot  a  pattern). 

/(*)  =  (1-xT1 

fix)  =  (i-*r2 

f"ix)  =  2(1 -jc)-3 
/"(*)  =  6(1  -vT4 
/W(x)  =  4!(l-x)-5 


/W(jc)  =  n!(l-jc)-"-1 


fn)i  0) 

*1(1-0)-' 

n ! 

n ! 

and  the  Maclaurin  series  is 

Ei- 

77=0 

77—0 

the  geometric  series. 

A  warning  is  in  order  here.  Given  a  function  /  we  may  be  able  to  compute  the  Maclaurin  series, 
but  that  does  not  mean  we  have  found  a  series  representation  for  /.  We  still  need  to  know  where  the 
series  converges,  and  if,  where  it  converges,  it  converges  to  fix).  While  for  most  commonly  encountered 
functions  the  Maclaurin  series  does  indeed  converge  to  /  on  some  interval,  this  is  not  true  of  all  functions, 
so  care  is  required. 

As  a  practical  matter,  if  we  are  interested  in  using  a  series  to  approximate  a  function,  we  will  need 
some  finite  number  of  terms  of  the  series.  Even  for  functions  with  messy  derivatives  we  can  compute 
these  using  computer  software  like  Sage.  If  we  want  to  describe  a  series  completely,  we  would  like  to 
be  able  to  write  down  a  formula  for  a  typical  term  in  the  series.  Fortunately,  a  few  of  the  most  important 
functions  are  very  easy. 
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Example  9.53:  Maclaurin  Series 


Find  the  Maclaurin  series  for  sinx. 


Solution.  Computing  the  first  few  derivatives  is  simple:  f'(x)  —  cosx,  f"{x)  —  —  sinx,  f"\x)  —  —  cosx, 
/f4j  (x)  =  sinx,  and  then  the  pattern  repeats.  The  values  of  the  derivative  when  x  =  0  are:  1,  0,  —1,  0,  1,  0, 
—  1,0,...,  and  so  the  Maclaurin  series  is 

r3  r5  °°  r2rc+l 

We  should  always  determine  the  radius  of  convergence: 

|x|2n+3  (2/i  +  l)!  |x| 2 

«— ►«>  (2n  + 3)!  |x|2”+1  n-^°°  (2n  +  3)(2n  +  2) 

so  the  series  converges  for  every  x.  Since  it  turns  out  that  this  series  does  indeed  converge  to  sinx  every¬ 
where,  we  have  a  series  representation  for  sinx  for  every  x.  X 

Sometimes  the  formula  for  the  nth  derivative  of  a  function  /  is  difficult  to  discover,  but  a  combination 
of  a  known  Maclaurin  series  and  some  algebraic  manipulation  leads  easily  to  the  Maclaurin  series  for  /. 


Solution.  To  get  from  sinx  to  xsin(  — x)  we  substitute  — x  for  x  and  then  multiply  by  x.  We  can  do  the  same 
thing  to  the  series  for  sinx: 


n= 0 


(-x)2,,+ 1 
(2n+l)! 


oo 


=  *£(-l)"(-l)2"+1 

71=0 


x2n~i~  * 

(2n  +  1) ! 


E(-ir+l 


77=0 


x2n+2 
(2/1  +  1)!’ 


X 

As  we  have  seen,  a  power  series  can  be  centered  at  a  point  other  than  zero,  and  the  method  that 
produces  the  Maclaurin  series  can  also  produce  such  series. 


Solution.  If  the  series  is  ^  a„(x  +  2)n  then  looking  at  the  kth  derivative: 

77=0 


k!(l  —  x) 


-k —  l 


n: 


, =k(n~kV- 


an(x  +  2) 


n—k 
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and  substituting  x  —  —2  we  get  Id 3  k  1  =  Ida £  and  a^  —  S  k  1  =  l/3fe+1,  so  the  series  is 

f,  (x  +  2)n 


n= 0 


3M+1 


Such  a  series  is  called  the  Taylor  series  for  the  function,  and  the  general  term  has  the  form 

f{n\a) 


n: 


(x  —  a)n. 


A  Maclaurin  series  is  simply  a  Taylor  series  with  a  —  0. 


Exercises  for  9.10 


Exercise  9.10.1  For  each  function,  find  the  Maclaurin  series  or  Taylor  series  centered  at  a,  and  the  radius 
of  convergence. 

(a)  cos* 

(b)  ex 

(c)  l/x,  a  —  5 

( d)  \nx,  £7—1 

(e)  \nx,  a  —  2 

(f)  l/x2,  a=  1 

(g)  l/Vl -x 

o 

(h)  Find  the  first  four  terms  of  the  Maclaurin  series  for  tanx  (up  to  and  including  the  x  term). 

(i)  Use  a  combination  of  Maclaurin  series  and  algebraic  manipulation  to  find  a  series  centered  at  zero 
for  xcos(x2). 

(j)  Use  a  combination  of  Maclaurin  series  and  algebraic  manipulation  to  find  a  series  centered  at  zero 
for  xe~x. 
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9.11  Taylor’s  Theorem 


One  of  the  most  important  uses  of  infinite  series  is  using  an  initial  portion  of  the  series  for  /  to  approximate 
/.  We  have  seen,  for  example,  that  when  we  add  up  the  first  n  terms  of  an  alternating  series  with  decreasing 
terms  that  the  difference  between  this  and  the  true  value  is  at  most  the  size  of  the  next  term.  A  similar 
result  is  true  of  many  Taylor  series. 


Proof.  The  proof  requires  some  cleverness  to  set  up,  but  then  the  details  are  quite  elementary.  We  define  a 
function  F(t)  as  follows: 

F{t)=  YT—^-{x-t)n  +  B{x-t)N+l. 

n=0  n! 

Here  we  have  replaced  a  by  t  in  the  first  N  + 1  terms  of  the  Taylor  series,  and  added  a  carefully  chosen  term 
on  the  end,  with  B  to  be  determined.  Note  that  we  are  temporarily  keeping  x  fixed,  so  the  only  variable  in 
this  equation  is  t,  and  we  will  be  interested  only  in  t  between  a  and  x.  Now  substitute  t  =  a: 

F(a)=  £  — p-(x-a)"+B(x-a)N+l. 

71=0  ^  ‘ 


Set  this  equal  to  f{x)\ 

f(x)  =  t  (x-af+B^-af+K 

c\  n\ 

77=0 

Since  we  can  solve  this  for  B,  which  is  a  “constant” — it  depends  on  x  and  a  but  those  are  temporarily 
fixed.  Now  we  have  defined  a  function  F(t)  with  the  property  that  F(a)  —  f(x).  Also,  all  terms  with 
a  positive  power  of  (x  —  t)  become  zero  when  we  substitute  x  for  t,  so  F(x)  —  f^\x)/ 0!  =  f{x).  So 
F[a )  =  F(pc).  By  Rolle’s  theorem  (5.21),  we  know  that  there  is  a  value  z  G  {a,x)  such  that  F'(z)  =  0.  But 
what  is  F'l  Each  term  in  F(t),  except  the  first  term  and  the  extra  term  involving  B.  is  a  product,  so  to  take 
the  derivative  we  use  the  product  rule  on  each  of  these  terms. 


m=f(o+^(x-o'+^P(x-o2+^(x-o3+--- 

t  j}( v  +\N+1 


+  m  (x-t)  +B(x  —  t) 


So  the  derivative  is 


F'(t)  =  /'(<)+ 1  ^^(x-t)°(-i)  +  ^r^(x->y 
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± 


± 


+ 


1! 


2! 


7(3)(0,„  +\2f  ^  ,  /(4)(0,„  *\3 


2! 


(*_0  (_1H - xt — (x_0  H - l~ 


3! 


fW(l)  n,J' 


1)  + 


AM 


(7V-1)! 

+  fl(lV+l)(jc-f)iV(-l). 

The  second  term  in  each  parenthesis  cancel  with  the  first  term  in  the  next  one,  leaving  just 


F'(t)  = 


/'"+1)w,..  „WlD,„,,v. 


/V! 


-(x-<)"  +  B(iV+l)(x-<)"(-l). 


At  some  z,  F'(z)  =  0  so 


0  = 


/(A,+1)«,„  wv 


(x-z)" +B(N+1)(x-z)n(-1) 


AM 


B{N+\)(x-z)n  =  f[N+^}Z\x~z)N 


B  = 


AM 
f(N+l\z) 

(A±  1) !  ' 


Now  we  can  write 


N 


f(<)  =  E  (*->)''+ ± ,  27/  -  ow+1 


n= 0 


n\ 


(iV+1)! 


Recalling  that  F(a)  —  f(x)  we  get 


N  y(n)^ 


/(«+«(.) 


/w =F(a)  =  L  -  o-nf+M,  - a)K+i • 


n= 0 


(iV+1)! 


which  is  what  we  wanted  to  show. 

It  may  not  be  immediately  obvious  that  this  is  particularly  useful;  let’s  look  at  some  examples. 


Example  9.57: 


Find  a  polynomial  approximation  for  sinx  accurate  to  ±0.005  for  values  ofx  in  [ — tt/2,  tt/2]  . 


Solution.  From  Taylor’s  theorem  with  a  =  0: 


sinx  = 


n= 0 


77! 


±+l)W  v+, 
(717+1)! 


What  can  we  say  about  the  size  of  the  term 


(7V+1)!  ' 


376  Sequences  and  Series 


Every  derivative  of  sinx  is  ±sinx  or  ±cosx,  so  |/^V+1^(z)|  <  1. 
So  we  need  to  pick  N  so  that 

Ev+1 

<  0.005. 


(IV+  1) ! 


Since  we  have  limited  x  to  [— n/2,n/2]. 


x"+1 


< 


2N+l 

(IV+  1) !  ' 


(1V+1)! 

The  quantity  on  the  right  decreases  with  increasing  N,  so  all  we  need  to  do  is  find  an  N  so  that 

2n+i 

<  0.005. 


(IV+1)! 

A  little  trial  and  error  shows  that  N  =  8  works,  and  in  fact  29/9!  <  0.0015,  so 


Jl  f(n)(0) 

sinx=  - t — ~  xn  it  0.0015 


n= 0 


n\ 

3  5 

X  X 


X  6  +  120  5040 


±0.0015. 


Figure  9.5  shows  the  graphs  of  sinx  and  and  the  approximation  on  [0, 3 tt/2]  .  As  x  gets  larger,  the  approx¬ 
imation  heads  to  negative  infinity  very  quickly,  since  it  is  essentially  acting  like  — x7.  X 


Figure  9.5:  sinx  and  a  polynomial  approximation. 


Note  that  we  can  now  approximate  the  value  of  sin(x)  to  within  0.005  by  using  simple  trigonometric 
identities  to  translatex  into  the  interval  [— n/2, n/2}. 

We  can  extract  a  bit  more  information  from  this  example.  If  we  do  not  limit  the  value  of  x,  we  still 
have 

(IV+1)! 


x/v+1 

(N+l)! 


77=0 


ft! 


x*+1 

(IV+1)! 


so  that  sinx  is  represented  by 
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If  we  can  show  that 


for  each  x  then 


lim 

N—>° ° 


X*+1 

(TV  H-  1 ) ! 


0 


sinx  = 


f  f(n)( Q)x„ 

n ! 


K-i) 


n= 0 


x2n+t 
(2n+  1)!’ 


that  is,  the  sine  function  is  actually  equal  to  its  Maclaurin  series  for  all  x.  How  can  we  prove  that  the 
limit  is  zero?  Suppose  that  N  is  larger  than  |x|,  and  let  M  be  the  largest  integer  less  than  |x|  (if  M  —  0  the 
following  is  even  easier).  Then 


|xA,+1|  |x|  |x|  |x|  |x|  |x|  |x| 

(TV  -h  1 ) !  “  N+\~N  N-\"  M+llti  M-\" 

<  M  !  !  !  M  M  MM 

“  N+l  MM- 1  2  1 

|x|  \x\M 
~  N+l  ~M\ 

The  quantity  \x\M /Ml  is  a  constant,  so 


|x|  \x\M 

lim - =  0 

A^<x,/V+  1  Ml 


|x|  |x| 

TT 


and  by  the  Squeeze  Theorem  (9.6) 


lim 

N — ^°° 


x^+i 


-0 


(TV+  1)! 

as  desired.  Essentially  the  same  argument  works  for  cosx  and  ex\  unfortunately,  it  is  more  difficult  to  show 
that  most  functions  are  equal  to  their  Maclaurin  series. 


Example  9.58: 


Find  a  polynomial  approximation  for  ex  near  x  —  2  accurate  to  ±0.005. 


Solution  From  Taylor’s  theorem: 


N  2 

r=  7  — 
,  nl 


N+l 


since  (x)  =  ex  for  all  n.  We  are  interested  in  x  near  2,  and  we  need  to  keep  |  (x  —  2)w+1 1  in  check,  so 
we  may  as  well  specify  that  |x  —  2|  <  1,  so  x  6  [1,3].  Also 


(TV  -1-1)! 


< 


(TV+  1) !  ’ 


so  we  need  to  hnd  an  N  that  makes  e3/(TV±  1) !  <  0.005.  This  time  N  —  5  makes  c3/(TV±  1)!  <  0.0015, 
so  the  approximating  polynomial  is 


2  „2  „2  „2 

4 


e'  =  e2+e2{x-2)  +  j{.x-2)2  +  2{x-2)3  +  f-{.x-2)i  +  2-{x-2)5±0.0015. 
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Note  that  our  approximation  requires  that  we  already  have  a  very  accurate  approximation  of  the  value 
e 2,  which  we  shouldn't  assume  we  have  in  the  context  of  trying  to  approximate  ex .  For  this  reason  we 
typically  try  to  center  our  series  on  values  for  which  the  derivative  of  the  function  is  easy  to  evaluate  (e.g. 
a  —  0).  4k 

Note  well  that  in  these  examples  we  found  polynomials  of  a  certain  accuracy  only  on  a  small  interval, 
even  though  the  series  for  sinx  and  ex  converge  for  all  x;  this  is  typical.  To  get  the  same  accuracy  on  a 
larger  interval  would  require  more  terms. 


Exercises  for  9.11 


Exercise  9.11.1  Find  a  polynomial  approximation  for  cosx  on  [0,  n\,  accurate  to  ±10  3 

Exercise  9.11.2  How  many  terms  of  the  series  for  lnx  centered  at  1  are  required  so  that  the  guaranteed 
error  on  [1/2, 3/2]  is  at  most  10  3  ?  What  if  the  interval  is  instead  [1,3/2]? 

Exercise  9.11.3  Find  the  first  three  nonzero  terms  in  the  Taylor  series  for  tanx  on  [— tt/4,  7t/4],  and 
compute  the  guaranteed  error  term  as  given  by  Taylor’s  theorem.  (You  may  want  to  use  Sage  or  a  similar 
aid.) 

Exercise  9.11.4  Show  that  cosx  is  equal  to  its  Taylor  series  for  all  x  by  showing  that  the  limit  of  the  error 
term  is  zero  as  N  approaches  infinity. 

Exercise  9.11.5  Show  that  ex  is  equal  to  its  Taylor  series  for  all  x  by  showing  that  the  limit  of  the  error 
term  is  zero  as  N  approaches  infinity. 


10.  Differential  Equations 


Many  physical  phenomena  can  be  modeled  using  the  language  of  calculus.  For  example,  observational 
evidence  suggests  that  the  temperature  of  a  cup  of  tea  (or  some  other  liquid)  in  a  room  of  constant  tem¬ 
perature  will  cool  over  time  at  a  rate  proportional  to  the  difference  between  the  room  temperature  and  the 
temperature  of  the  tea. 

In  symbols,  if  t  is  the  time,  M  is  the  room  temperature,  and  /'(/)  is  the  temperature  of  the  tea  at  time  t 
then/'(t)  —  where/:  >  0  is  a  constant  which  will  depend  on  the  kind  of  tea  (or  more  generally 

the  kind  of  liquid)  but  not  on  the  room  temperature  or  the  temperature  of  the  tea.  This  is  Newton’s  law 
of  cooling  and  the  equation  that  we  just  wrote  down  is  an  example  of  a  differential  equation.  Ideally 
we  would  like  to  solve  this  equation,  namely,  find  the  function  f(t  )  that  describes  the  temperature  over 
time,  though  this  often  turns  out  to  be  impossible,  in  which  case  various  approximation  techniques  must 
be  used.  The  use  and  solution  of  differential  equations  is  an  important  field  of  mathematics;  here  we  see 
how  to  solve  some  simple  but  useful  types  of  differential  equation. 

Informally,  a  differential  equation  is  an  equation  in  which  one  or  more  of  the  derivatives  of  some  func¬ 
tion  appears.  Typically,  a  scientific  theory  will  produce  a  differential  equation  (or  a  system  of  differential 
equations)  that  describes  or  governs  some  physical  process,  but  the  theory  will  not  produce  the  desired 
function  or  functions  directly. 


10.1  First  Order  Differential  Equations 


We  start  by  considering  equations  in  which  only  the  first  derivative  of  the  function  appears. 


Definition  10.1:  First  Order  Differential  Equation 


A  first  order  differential  equation  is  an  equation  of  the  form  F{t,y,y')  —  0.  A  solution  of  a  first 
order  differential  equation  is  a  function  f(t )  that  makes  F  (l  ,f  (l) (t))  =  0  for  every  value  of  t. 


Here,  F  is  a  function  of  three  variables  which  we  label  t,  y,  and  /.  It  is  understood  that  /  will  explicitly 
appear  in  the  equation  although  t  and  y  need  not.  The  term  “first  order”  means  that  the  first  derivative  of  y 
appears,  but  no  higher  order  derivatives  do. 


Example  10.2:  Newton’s  Law  of  Cooling 


The  equation  from  Newton’s  law  of  cooling,  y'  —  k{M  —  y )  is  a  first  order  differential  equation; 
F(t,y,y')  =  k{M —y)  —  y. 
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Example  10.3:  A  First  Order  Differential  Equation 


y'  —  t2  +  1  is  a  first  order  differential  equation;  F(l,y,y  )  —  y'  —  t2  —  1 .  All  solutions  to  this  equation 
are  of  the  form  t3/3  +  t  +  C. 


Definition  10.4:  First  Order  Initial  Value  Problem 


A  first  order  initial  value  problem  is  a  system  of  equations  of  the  form  F(t,y,y ')  —  0,  y(?o)  =  Yq. 
Here  to  is  a  fixed  time  and  yo  is  a  number.  A  solution  of  an  initial  value  problem  is  a  solution  f{t) 
of  the  differential  equation  that  also  satisfies  the  initial  condition  /(to)  =  Vo- 


Example  10.5:  An  Initial  Value  Problem 


Verify  that  the  initial  value  problem  y  —  l2  +  1 ,  v(  1 )  —4  has  solution  f(t )  =  t3 /3  +  t  +  8/3. 


Solution.  Observe  that  f{t)  —  t2  +  1  and  /(l)  =  l3/2  +  1  +  8/3  =  4  as  required.  X 

The  general  first  order  equation  is  too  general,  so  we  can’t  describe  methods  that  will  work  on  them 
all,  or  even  a  large  portion  of  them.  We  can  make  progress  with  specific  kinds  of  first  order  differential 
equations.  For  example,  much  can  be  said  about  equations  of  the  form  y'  =  < p(t,y )  where  0  is  a  function 
of  the  two  variables  t  and  y.  Under  reasonable  conditions  on  (j),  such  an  equation  has  a  solution  and  the 
corresponding  initial  value  problem  has  a  unique  solution.  However,  in  general,  these  equations  can  be 
very  difficult  or  impossible  to  solve  explicitly. 


Example  10.6:  IVP  for  Newton’s  Law  of  Cooling 


Consider  this  specific  example  of  an  initial  value  problem  for  Newton ’s  law  of  cooling:  y'  —  2(25  — 
y),  y(0)  =  40.  Discuss  the  solutions  for  this  initial  value  problem. 


Solution.  We  first  note  the  zero  of  the  equation:  If  y(to)  —  25,  the  right  hand  side  of  the  differential 
equation  is  zero,  and  so  the  constant  function  y(t)  —  25  is  a  solution  to  the  differential  equation.  It  is 
not  a  solution  to  the  initial  value  problem,  since  y(0)  /  25.  (The  physical  interpretation  of  this  constant 
solution  is  that  if  a  liquid  is  at  the  same  temperature  as  its  surroundings,  then  the  liquid  will  stay  at  that 
temperature.)  So  long  as  y  is  not  25,  we  can  rewrite  the  differential  equation  as 


dt  25  —  y 

— - dy  =  2  dt, 

25  —  y 


so 


2  dt. 


that  is,  the  two  anti-derivatives  must  be  the  same  except  for  a  constant  difference.  We  can  calculate  these 
anti-derivatives  and  rearrange  the  results: 


2  dt 
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(  1 )  In  1 25  —  y| 
In  j  25  —y\ 
\25-y\ 
y- 25 
J 


2 1  +  Co 

— 2 1  —  Co  —  — 2 1  -\-  C 
e~2t+c  —  e~2tec 
±ece~2t 

25  ±  ece~2t  —  25  +Ae~2t. 


Here  A  —  ±  ec  —  ±  e  c°  is  some  non-zero  constant.  Since  we  want  y( 0)  =  40,  we  substitute  and  solve  for 
A: 


40  =  25  +Ae° 

15  =  A, 

and  so  y  =  25  +  15e  is  a  solution  to  the  initial  value  problem.  Note  that  y  is  never  25,  so  this  makes  sense 
for  all  values  of  t.  However,  if  we  allow  A  =  0  we  get  the  solution  y  =  25  to  the  differential  equation,  which 
would  be  the  solution  to  the  initial  value  problem  if  we  were  to  require  y(0)  =  25.  Thus,  y  —  25  +Ae~2r 
describes  all  solutions  to  the  differential  equation  y  =  2(25  —y),  and  all  solutions  to  the  associated  initial 
value  problems.  X 

Why  could  we  solve  this  problem?  Our  solution  depended  on  rewriting  the  equation  so  that  all  in¬ 
stances  of  y  were  on  one  side  of  the  equation  and  all  instances  of  t  were  on  the  other.  Of  course,  in  this 
case  the  only  t  was  originally  hidden,  since  we  didn’t  write  dy/dt  in  the  original  equation.  This  is  not 
required,  however. 


Example  10.7:  Solving  an  IVP 


Solve  the  differential  equation  /  =  2/  (25  —  y) . 


Solution.  This  is  almost  identical  to  the  previous  example.  As  before,  y(t)  —  25  is  a  solution.  If  y  ^  25, 


(  1 )  In  1 25  — y| 
In  |25  —y\ 


25 -y\ 
y  — 25 
J 


j  2  tdt 
t 2  +  Co 

-t2  -  c0  =  -r  +  C 

e~t2+c  =  e~t2ec 
±ece~t 2 

25  ±ece~t2  —  25  +Ae~t2 . 


As  before,  all  solutions  are  represented  by  y 


25  +Ae~t2 


,  allowing  A  to  be  zero. 


X 
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As  in  the  examples,  we  can  attempt  to  solve  a  separable  equation  by  converting  to  the  form 


This  technique  is  called  separation  of  variables.  The  simplest  (in  principle)  sort  of  separable  equation  is 
one  in  which  g(y)  —  1,  in  which  case  we  attempt  to  solve 

/  ldy  =  J  f{t)dt. 

We  can  do  this  if  we  can  find  an  anti-derivative  of  f(t) . 

As  we  have  seen  so  far,  a  differential  equation  typically  has  an  infinite  number  of  solutions.  Such 
a  solution  is  called  a  general  solution.  A  corresponding  initial  value  problem  will  give  rise  to  just  one 
solution.  Such  a  solution  in  which  there  are  no  unknown  constants  remaining  is  called  a  particular 
solution. 

The  general  approach  to  separable  equations  is  as  follows:  Suppose  we  wish  to  solve  /  =  f(t)g(y) 
where  /  and  g  are  continuous  functions.  If  g(a)  =  0  for  some  a  then  y(t)  —  a  is  a  constant  solution  of  the 
equation,  since  in  this  case  /  =  0  =  f(t)g(a).  For  example,  /  —y2  —  1  has  constant  solutions  y(t)  =  1  and 
y(t)  =  -l. 

To  find  the  nonconstant  solutions,  we  note  that  the  function  1  / g(y)  is  continuous  where  g  ^  0,  so  1  / g 
has  an  antiderivative  G.  Let  F  be  an  antiderivative  of  /.  Now  we  write 

G(y)  =  J  dy=  J  f(t)dt  =  F(t)+C, 

so  G(y)  —  F(t)+  C.  Now  we  solve  this  equation  for  y. 

Of  course,  there  are  a  few  places  this  ideal  description  could  go  wrong:  We  need  to  be  able  to  find  the 
antiderivatives  G  and  F,  and  we  need  to  solve  the  final  equation  for  y.  The  upshot  is  that  the  solutions  to 
the  original  differential  equation  are  the  constant  solutions,  if  any,  and  all  functions  y  that  satisfy  G(y)  = 
F(t)+C. 


Example  10.9:  Population  Growth  and  Radioactive  Decay 


Analyze  the  differential  equation  y'  —  ky. 


Solution.  When  k  >  0,  this  describes  certain  simple  cases  of  population  growth:  It  says  that  the  change 
in  the  population  y  is  proportional  to  the  population.  The  underlying  assumption  is  that  each  organism  in 
the  current  population  reproduces  at  a  fixed  rate,  so  the  larger  the  population  the  more  new  organisms  are 
produced.  While  this  is  too  simple  to  model  most  real  populations,  it  is  useful  in  some  cases  over  a  limited 
time.  When  k  <  0,  the  differential  equation  describes  a  quantity  that  decreases  in  proportion  to  the  current 
value;  this  can  be  used  to  model  radioactive  decay. 

The  constant  solution  is  y(t)  =  0;  of  course  this  will  not  be  the  solution  to  any  interesting  initial  value 
problem.  For  the  non-constant  solutions,  we  proceed  much  as  before: 


kdt 
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In  |_y|  =  kt  +  C 
|y|  =  ektec 

y  =  ±ecekt 
y  =  Aekt . 

Again,  if  we  allow  A  —  0  this  includes  the  constant  solution,  and  we  can  simply  say  that  y  —  Aekt  is  the 
general  solution.  With  an  initial  value  we  can  easily  solve  for  A  to  get  the  solution  of  the  initial  value 
problem.  In  particular,  if  the  initial  value  is  given  for  time  t  —  0,  y(0)  —  yo,  then  A —yo  and  the  solution  is 

y  =  yoekt.  X 


Exercises  for  10.1 


Exercise  10.1.1  Which  of  the  following  equations  are  separable? 

(a)  y'  =  sin(ty) 

(b)  y  —  el  ey 

(c)  yy'  =  t 

( d)  y  —  ( t 3  —  t)  arcsin(y) 

(e)  y'  =  t2  lny  +  At3  lny 


Exercise  10.1.2  Solve  y  =  1/(1+ 1 2). 

Exercise  10.1.3  Solve  the  initial  value  problem  y'  —  tn  with  y(0)  =  1  and  n  >  0. 

Exercise  10.1.4  Solve  y'  —  Inf. 

Exercise  10.1.5  Identify  the  constant  solutions  (if  any)  ofy'  —  t  si  n  v. 

Exercise  10.1.6  Identify  the  constant  solutions  (if  any)  ofy  —  tey . 

Exercise  10.1.7  Solve  y'  =  t/y. 

Exercise  10.1.8  Solve  y  —  y2  —  1. 

Exercise  10.1.9  Solve  y'  —  t  /  (y3  —  5).  You  may  leave  your  solution  in  implicit  form:  that  is,  you  may  stop 
once  you  have  done  the  integration,  without  solving  for  y. 


Exercise  10.1.10  Find  a  non-constant  solution  of  the  initial  value  problem  y'  —  yl/3,  y(0)  =  0,  using 
separation  of  variables.  Note  that  the  constant  function  y(t)  =  0  also  solves  the  initial  value  problem. 
This  shows  that  an  initial  value  problem  can  have  more  than  one  solution. 
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Exercise  10.1.11  Solve  the  equation  for  Newton’s  law  of  cooling  leaving  M  and  k  unknown. 

Exercise  10.1.12  After  10  minutes  in  Jean-Luc’s  room,  his  tea  has  cooled  to  40°  Celsius  from  100° 
Celsius.  The  room  temperature  is  25°  Celsius.  How  much  longer  will  it  take  to  cool  to  35°  ? 

Exercise  10.1.13  Solve  the  logistic  equation  /  =  ky(M  —  y).  (This  is  a  somewhat  more  reasonable 
population  model  in  most  cases  than  the  simpler  /  =  ky.)  Sketch  the  graph  of  the  solution  to  this  equation 
when  M  —  1000,  k  =  0.002,  y(0)  =  1. 

Exercise  10.1.14  Suppose  thaty '  —  ky,  v(0)  =  2,  andy'{ 0)  =  3.  What  isy? 

Exercise  10.1.15  A  radioactive  substance  obeys  the  equation  y'  =  ky  where  k  <  0  and  y  is  the  mass  of  the 
substance  at  time  l.  Suppose  that  initially,  the  mass  of  the  substance  is  v(0j  —  M  >  0.  At  what  time  does 
half  of  the  mass  remain?  (This  is  known  as  the  half  life.  Note  that  the  half  life  depends  on  k  but  not  on 
M.) 

Exercise  10.1.16  Bismuth-210  has  a  half  life  of  five  days.  If  there  is  initially  600  milligrams,  how  much 
is  left  after  6  days?  When  will  there  be  only  2  milligrams  left? 

Exercise  10.1.17  The  half  life  of  carbon-14  is  5730  years.  If  one  starts  with  100  milligrams  of  carbon-14, 
how  much  is  left  after  6000  years?  How  long  do  we  have  to  wait  before  there  is  less  than  2  milligrams? 

Exercise  10.1.18  A  certain  species  of  bacteria  doubles  its  population  (or  its  mass)  every  hour  in  the  lab. 
The  differential  equation  that  models  this  phenomenon  is  y'  —  ky,  where  k  >  0  and  y  is  the  population  of 
bacteria  at  time  t.  What  is  y  ? 

Exercise  10.1.19  If  a  certain  microbe  doubles  its  population  every  4  hours  and  after  5  hours  the  total 
population  has  mass  500  grams,  what  was  the  initial  mass? 


10.2  First  Order  Homogeneous  Linear  Equations 


A  simple,  but  important  and  useful,  type  of  separable  equation  is  the  first  order  homogeneous  linear 
equation: 


Definition  10.10:  First  Order  Homogeneous  Linear  Equation 


A  first  order  homogeneous  linear  differential  equation  is  one  of  the  form  y'  +  p(t)y  —  0  or  equiva¬ 
lently  y'  =  —p(t)y. 


“Homogeneous”  refers  to  the  zero  on  the  right  side  of  the  equation,  provided  that  y'  and  y  are  on  the 
left.  “Linear”  in  this  definition  indicates  that  both  /  and  y  appear  independently  and  explicitly;  we  don’t 
see  y'  or  y  to  any  power  greater  than  1,  or  multiplied  by  each  other  (i.e.  y'y ). 
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Example  10.11:  Linear  Examples 


The  equation  y  —  It  (25  —y)  can  be  written  y  +2ty  —  50 1.  This  is  linear,  but  not  homogeneous.  The 
equation  y  —  ky,  ory'  —  ky  —  0  is  linear  and  homogeneous,  with  a  particularly  simple  p(t )  =  —k. 
The  equation  /  +  y2  —  0  is  homogeneous,  but  not  linear. 


Since  first  order  homogeneous  linear  equations  are  separable,  we  can  solve  them  in  the  usual  way: 


/  =  -p(<)y 


In  |y|  =  P(t)  +  C 


y  =  ±/"> 
y  =  A/W, 

where  P{t)  is  an  anti-derivative  of  —p(t).  As  in  previous  examples,  if  we  allow  A  —  0  we  get  the  constant 
solution  y  —  0. 


Solution  We  start  with 

P(t)  —  costdt  —  —  sint, 
so  the  general  solution  to  the  differential  equation  is 

y  =  Ae~sint . 


To  compute  A  we  substitute: 


so  the  solutions  is 


For  the  second  problem, 


1 

2 


=  Ag-sinO 


A, 


y  = 


sin/ 


1  =  Ae~si 

2 

A  =  -esin‘ 
2 


so  the  solution  is 


1 
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Example  10.13: 


Solve  the  initial  value  problem  ty'  +  3y  =  0,  y(  1 )  =  2,  assuming  t  >  0. 


Solution.  We  write  the  equation  in  standard  form:  /  +  3 y/t  —  0.  Then 


and 

Substituting  to  find  A:  2=A(1)~3 


P(t)  = 


j  —  jdt  —  — 31n  t 


y  —  Ae  3lnt—At  3. 


_ o 

=  A,  so  the  solution  is  y  =  2t 


X 


Exercises  for  10.2 


Find  the  general  solution  of  each  equation  in  the  following  exercises. 

Exercise  10.2.1  y  +5y  —  0 


Exercise  10.2.3  y  H - =  0 

1  +  t2 


Exercise  10.2.4  y  +  t2y  —  0 


Exercise  10.2.2  y  —  2y  =  0 

In  the  following  exercises,  solve  the  initial  value  problem. 

Exercise  10.2.5  /  +  y  —  0,  y( 0)  =  4  Exercise  10.2.10  y  +  ycos(er)  =  0,  y(0)  =  0 

Exercise  10.2.6  /  -  3y  =  0,  y(l)  =  -2  Exercise  10.2.11  ty'  -  2y  —  0,  y(l)  =  4 

Exercise  10.2.7  y'  +ys\nt  =  0,  y(ji)  =  1  Exercise  10.2.12  t2y'  +  y  =  0,  y(l)  =  -2,  t  >  0 

Exercise  10.2.8  y'  Aye'  =  0,  y(0)  =  c  Exercise  10.2.13  t3y'  =  2 y,  y(l)  =  1,  t  >  0 

Exercise  10.2.9  y'  +y\f  1  +t4  —  0,  y(0)  =  0  Exercise  10.2.14  t3y  =  2y,  y(l)  =  0,  t  >  0 


Exercise  10.2.15  A  function  y(t)  is  a  solution  ofy'  +  ky  =  0.  Suppose  thaty(  0)  =  100  and  _y(2)  =  4.  Find 
k  and  find  y(t). 

Exercise  10.2.16  A  function  y(t  )  is  a  solution  of  y  +  ry  —  0.  Suppose  that  y(0)  —  1  and  y(l)  =  e”13. 
Find  k  and  find  y(t ). 

Exercise  10.2.17  A  bacterial  culture  grows  at  a  rate  proportional  to  its  population.  If  the  population  is 
one  million  att  —  0  and  1.5  million  at  t  —  1  hour,  find  the  population  as  a  function  of  time. 

Exercise  10.2.18  A  radioactive  element  decays  with  a  half-life  of  6  years.  If  a  mass  of  the  element  weighs 
ten  pounds  at  t  —  0,  find  the  amount  of  the  element  at  time  t. 
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10.3  First  Order  Linear  Equations 


As  you  might  guess,  a  first  order  linear  differential  equation  has  the  form  y  +  p(t)y  —  /(f) .  Not  only  is  this 
closely  related  in  form  to  the  first  order  homogeneous  linear  equation,  we  can  use  what  we  know  about 
solving  homogeneous  equations  to  solve  the  general  linear  equation. 

Suppose  that  yi  (f)  and  y2(t)  are  solutions  to  y'  +  p{t)y  —  f(t).  Let  g(t)  —  y i  —  y2.  Then 

g'{t)+p{t)g{t)  =  y[-y2+p(t)(yi-y2) 

=  (/i  +p{t)y  i  )  -  (/2 + p{t)yi) 

=  m-m= o. 

In  other  words,  g(t)  —  yi  —  y2  is  a  solution  to  the  homogeneous  equation  y'  +  p(t)y  =  0.  Turning  this 
around,  any  solution  to  the  linear  equation  y  +  p{t)y  =  /(f),  call  it  y i,  can  be  written  as  y2  +  g(t),  for 
some  particular  y2  and  some  solution  g(t)  of  the  homogeneous  equation  y'  +  p(t)y  —  0.  Since  we  already 
know  how  to  find  all  solutions  of  the  homogeneous  equation,  finding  just  one  solution  to  the  equation 
y  +  p{t)y  =  /(f)  will  give  us  all  of  them. 

How  might  we  find  that  one  particular  solution  to  y  +  p(t)y  —  /(f)?  Again,  it  turns  out  that  what  we 
already  know  helps.  We  know  that  the  general  solution  to  the  homogeneous  equation  y  +  p(t)y  —  0  looks 
like  Aep<yt\  We  now  make  an  inspired  guess:  Consider  the  function  v(t)ep^\  in  which  we  have  replaced 
the  constant  parameter  A  with  the  function  v(t).  This  technique  is  called  variation  of  parameters.  For 
convenience  write  this  as  s(t )  =  v(t)h(t),  where  h(t)  —  ep ^  is  a  solution  to  the  homogeneous  equation. 
Now  let’s  compute  a  bit  with  s(t): 

s'(t )  +  p(t)s(t )  =  v{t)h'{t)  +  v'(t)h(t)  +  p(t)v(t)h(t ) 

=  v(t)(h'(t)  +  p(t)h(t))  +v'(t)h(t ) 

=  v'(t)h{t). 

The  last  equality  is  true  because  h\t)  +  p(t)h(t)  =  0.  Since  h(t)  is  a  solution  to  the  homogeneous  equation. 
We  are  hoping  to  find  a  function  s(t)  so  that  s'(t)  +  p(t)s(t)  —  /(f);  we  will  have  such  a  function  if  we 
can  arrange  to  have  v'(t)h(t)  —  /(f),  that  is,  v'(t)  —  f(t)/h(t).  But  this  is  as  easy  (or  hard)  as  finding  an 
anti-derivative  of  f(t)/h{t).  Putting  this  all  together,  the  general  solution  to  y  +  p(t)y  —  /(f)  is 

v[t)h{t)  +Aep{t)  =  v[t)ep &  +Aep^. 


Example  10.14:  Solving  an  IVP 


Find  the  solution  of  the  initial  value  problem  y  +3y/t  =  t2,  y(  1 )  =  1/2. 


Solution.  First  we  find  the  general  solution;  since  we  are  interested  in  a  solution  with  a  given  condition  at 
t  —  1,  we  may  assume  f  >  0.  We  start  by  solving  the  homogeneous  equation  as  usual;  call  the  solution  g : 

g  =  Ae~^3/t)dt  =Ae~31nt  =At~3. 

Then  as  in  the  discussion,  h(t)  —  f~3  and  v'(t)  —  t2 /f-3  =  t5,  so  v(f)  =  f6/ 6.  We  know  that  every  solution 
to  the  equation  looks  like 

v(f)f~3  +Ar3  =  *-f~3  +At~ 3  =  l—  +Ar3. 
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Finally  we  substitute  to  find  A: 

1 

2 

A 

The  solution  is  then 


Another  common  method  for  solving  such  a  differential  equation  is  by  means  of  an  integrating  factor. 
In  the  differential  equation  y'  +  p(j)y  =  fit),  we  note  that  if  we  multiply  through  by  a  function  I{l  )  to  get 
I{t)y' +  I{t)p{t)y  =  the  left  hand  side  looks  like  it  could  be  a  derivative  computed  by  the  product 

rule: 

|(/(<  )y)=I(l)y  +  I'(,)y. 

Now  if  we  could  choose  I(t)  so  that  I'(t )  =  I(t)p(t),  this  would  be  exactly  the  left  hand  side  of  the 
differential  equation.  But  this  is  just  a  first  order  homogeneous  linear  equation,  and  we  know  a  solution 

is  I{t)  —  where  Q(t )  =  J  pdt;  note  that  Q{t )  =  —P{t),  where  P(t)  appears  in  the  variation  of 

parameters  method  and  P'(t )  =  —p.  Now  the  modified  differential  equation  is 

e~p^y'  +  e~pVp(t)y  =  e~p^f(t) 

±(e-n.)y)  =  e-mm 

Integrating  both  sides  gives 

e~pVy  =  J  e-p[t)f(t)dt 

y  =  J e~p^f(t)dt. 


i^!+A(l)-3  =  ^+A 

6  6 

1  1  _  1 

2~  6  _  3' 


t  1  -3 

^=6+3'  ■ 


If  you  look  carefully,  you  will  see  that  this  is  exactly  the  same  solution  we  found  by  variation  of  parameters, 
because  e~p^f(t)  —  f(t)/h(t). 

Some  people  find  it  easier  to  remember  how  to  use  the  integrating  factor  method,  rather  than  variation 
of  parameters.  Since  ultimately  they  require  the  same  calculation,  you  should  use  whichever  of  the  two 
methods  appeals  to  you  more  strongly.  Using  this  method,  the  solution  of  the  previous  example  would  look 
just  a  bit  different:  Starting  with  y  +  3 y/t  —  t2,  we  recall  that  the  integrating  factor  is  —  e3lnt  —  t3. 
Then  we  multiply  through  by  the  integrating  factor  and  solve: 


t3y'  +  t33y/t 
t3y'  +  t23y 

j^y) 

t3y 


t6/ 6 
t3/ 6. 


This  is  the  same  answer,  of  course,  and  the  problem  is  then  finished  just  as  before. 
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Exercises  for  10.3 


In  the  following  exercises,  find  the  general  solution  of  the  equation. 

Exercise  10.3.1  y  +  4y  =  8 

Exercise  10.3.2  y'  —  2y  —  6 
Exercise  10.3.3  y'  +  ty  =  5 1 
Exercise  10.3.4  /  +  e'y  —  —2e‘ 

Exercise  10.3.5  y  —  y  —  t2 

Exercise  10.3.6  2 y  +y  —  t 

Exercise  10.3.7  ty  —  2y  =  \/t,  t  >  0 

Exercise  10.3.8  ty'  +  y  =  y/t,  t  >  0 

Exercise  10.3.9  y' cost  +  ysint  —  1,  —  n/2  <t<  n/2 

Exercise  10.3.10  y  +ysect  —  tan  t,  —71/2  <t  <  n/2 


10.4  Approximation 


We  have  seen  how  to  solve  a  restricted  collection  of  differential  equations,  or  more  accurately,  how  to 
attempt  to  solve  them — we  still  may  not  be  able  to  find  the  required  anti-derivatives.  Not  surprisingly, 
non-linear  equations  can  be  even  more  difficult  to  solve.  Yet  much  is  known  about  solutions  to  some  more 
general  equations. 

Suppose  (j)(t,y )  is  a  function  of  two  variables.  A  more  general  class  of  first  order  differential  equations 
has  the  form  /  =  (j)(t,y ).  This  is  not  necessarily  a  linear  first  order  equation,  since  (f>  may  depend  on  y 
in  some  complicated  way;  note  however  that  y'  appears  in  a  very  simple  form.  Under  suitable  conditions 
on  the  function  (f>,  it  can  be  shown  that  every  such  differential  equation  has  a  solution,  and  moreover 
that  for  each  initial  condition  the  associated  initial  value  problem  has  exactly  one  solution.  In  practical 
applications  this  is  obviously  a  very  desirable  property. 


Example  10.15:  First  Order  Non-linear 


The  equation  y  =  t  —  y2  is  a  first  order  non-linear  equation,  because  y  appears  to  the  second  power. 
We  will  not  be  able  to  solve  this  equation. 
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Example  10.16:  Non-linear  and  Separable 


The  equation  y  =  y2  is  also  non-linear,  but  it  is  separable  and  can  be  solved  by  separation  of 
variables. 


Not  all  differential  equations  that  are  important  in  practice  can  be  solved  exactly,  so  techniques  have 
been  developed  to  approximate  solutions.  We  describe  one  such  technique,  Euler’s  Method,  which  is 
simple  though  not  particularly  useful  compared  to  some  more  sophisticated  techniques. 

Suppose  we  wish  to  approximate  a  solution  to  the  initial  value  problem  y  —  (j)(t,y),  y(/o)  =  yo,  for 
t  >  to .  Under  reasonable  conditions  on  (j),  we  know  the  solution  exists,  represented  by  a  curve  in  the  t-y 
plane;  call  this  solution  f(t).  The  point  (to,yo)  is  of  course  on  this  curve.  We  also  know  the  slope  of  the 
curve  at  this  point,  namely  (j)(to,yo).  If  we  follow  the  tangent  line  for  a  brief  distance,  we  arrive  at  a  point 
that  should  be  almost  on  the  graph  of  f(t),  namely  (to+At,yo-\-(j)(to,yo)At);  call  this  point  (ti,yi).  Now  we 
pretend,  in  effect,  that  this  point  really  is  on  the  graph  of  f{t) ,  in  which  case  we  again  know  the  slope  of  the 
curve  through  (t\,y\),  namely  0(fi,yi).  So  we  can  compute  a  new  point,  {t2,yi)  —  (h  +At,yi  +  (j)(ti,yi)At) 
that  is  a  little  farther  along,  still  close  to  the  graph  of  f(t)  but  probably  not  quite  so  close  as  (ti,yi).  We 
can  continue  in  this  way,  doing  a  sequence  of  straightforward  calculations,  until  we  have  an  approximation 
(. tn,yn )  for  whatever  time  tn  we  need.  At  each  step  we  do  essentially  the  same  calculation,  namely: 


(ti+i,yi+i)  =  (ti  +  At,yi  +  (j)(ti,yi)At). 


We  expect  that  smaller  time  steps  At  will  give  better  approximations,  but  of  course  it  will  require  more 
work  to  compute  to  a  specified  time.  It  is  possible  to  compute  a  guaranteed  upper  bound  on  how  far  off  the 
approximation  might  be,  that  is,  how  far  yn  is  from  Suffice  it  to  say  that  the  bound  is  not  particularly 
good  and  that  there  are  other  more  complicated  approximation  techniques  that  do  better. 


Example  10.17:  Approximating  a  Solution 


Compute  an  approximation  to  the  solution  fory'  —  t  —  y2,  y(0)  =  0,  when  t  —  1. 


Solution.  We  will  use  At  —  0.2,  which  is  easy  to  do  even  by  hand,  though  we  should  not  expect  the 
resulting  approximation  to  be  very  good.  We  get 

(tuyi)  =  (0  +  0.2, 0  +  (0  —  02)0.2)  =  (0.2,0) 

[t2,y 2)  =  (0.2  +  0. 2,0+  (0.2  — 02)0.2)  =  (0.4,0.04) 

(t3,y3)  =  (0.6, 0.04  + (0.4  -  0.042)0.2)  =  (0.6,0.11968) 

(t4,y4)  =  (0.8,0.11968  + (0.6-0.119682)0.2)  =  (0.8,0.23681533952) 

(t5,y5)  =  (1.0,0.23681533952  + (0.6 -0.236815339522)0.2)  =  (1.0,0.385599038513605) 

So  y(l)  fn  0.3856.  As  it  turns  out,  this  is  not  accurate  to  even  one  decimal  place.  Figure  10.1  shows  these 
points  connected  by  line  segments  (the  lower  curve)  compared  to  a  solution  obtained  by  a  much  better 
approximation  technique.  Note  that  the  shape  is  approximately  correct  even  though  the  end  points  are 
quite  far  apart. 


10.4.  Approximation 


391 


y 


If  you  need  to  do  Euler’s  method  by  hand,  it  is  useful  to  construct  a  table  to  keep  track  of  the  work, 
as  shown  in  Figure  10.2.  Each  row  holds  the  computation  for  a  single  step:  The  starting  point  (t;-,y;);  the 
stepsize  At;  the  computed  slope  0 the  change  in y,  Ay  =  </>(/,,>’, j A/;  and  the  new  point,  (/,+ 1  ,y/+i )  = 
( ti  +  At.y,  +  Ay) .  The  starting  point  in  each  row  is  the  newly  computed  point  from  the  end  of  the  previous 
row. 


0  ,y) 

At 

$(t,y) 

Ay  =  </>(t,y)At 

(t  +  Af,y  +  Ay) 

(0,0) 

0.2 

0 

0 

(0.2,0) 

(0.2,0) 

0.2 

0.2 

0.04 

(0.4,0.04) 

(0.4,0.04) 

0.2 

0.3984 

0.07968 

(0.6,0.11968) 

(0.6,0.11968) 

0.2 

0.58... 

0.117... 

(0.8,0.236...) 

(0.8,0.236...) 

0.2 

0.743 . . . 

0.148... 

(1.0,0.385...) 

Figure  10.2:  Computing  with  Euler’s  Method. 


X 

Euler’s  method  is  related  to  another  technique  that  can  help  in  understanding  a  differential  equation  in 
a  qualitative  way.  Euler’s  method  is  based  on  the  ability  to  compute  the  slope  of  a  solution  curve  at  any 
point  in  the  plane,  simply  by  computing  If  we  compute  <j)(t,y)  at  many  points,  say  in  a  grid,  and 

plot  a  small  line  segment  with  that  slope  at  the  point,  we  can  get  an  idea  of  how  solution  curves  must  look. 
Such  a  plot  is  called  a  slope  field.  A  slope  field  for  0  =  t  —  y1  is  shown  in  Figure  10.3;  compare  this  to 
figure  10.1.  With  a  little  practice,  one  can  sketch  reasonably  accurate  solution  curves  based  on  the  slope 
field,  in  essence  doing  Euler’s  method  visually. 
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Figure  10.3:  A  slope  field  for  /  =  t—y2 


Even  when  a  differential  equation  can  be  solved  explicitly,  the  slope  field  can  help  in  understanding 
what  the  solutions  look  like  with  various  initial  conditions.  Recall  the  logistic  equation  y'  —  ky(M  —  y): 
y  is  a  population  at  time  t,  M  is  a  measure  of  how  large  a  population  the  environment  can  support,  and  k 
measures  the  reproduction  rate  of  the  population.  Figure  10.4  shows  a  slope  field  for  this  equation  that  is 
quite  informative.  It  is  apparent  that  if  the  initial  population  is  smaller  than  M  it  rises  to  M  over  the  long 
term,  while  if  the  initial  population  is  greater  than  M  it  decreases  to  M. 
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Figure  10.4:  A  slope  field  for  y  =  0.2y(  10  —  y). 


Exercises  for  10.4 


In  the  following  exercises,  compute  the  Euler  approximations  for  the  initial  value  problem  for  0  <  t  <  1 
and  At  =  0.2.  If  you  have  access  to  Sage,  generate  the  slope  field  first  and  attempt  to  sketch  the  solution 
curve.  Then  use  Sage  to  compute  better  approximations  with  smaller  values  of  At. 

Exercise  10.4.1  /  =  t/y,  y(0)  =  1 
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Exercise  10.4.2  y  —  t+y3,  y(0)  —  1 
Exercise  10.4.3  y'  —  cos  (t+y),  y(  0)  =  1 
Exercise  10.4.4  y'  —  tiny,  y(0)  =  2 

10.5  Second  Order  Homogeneous  Equations 


A  second  order  differential  equation  is  one  containing  the  second  derivative  y" .  These  are  in  general 
quite  complicated,  but  one  fairly  simple  type  is  useful:  The  second  order  linear  equation  with  constant 
coefficients. 


Example  10.18:  Second  Order  Homogeneous  Equation 


Analyze  the  intial  value  problem  y"  —  / j  —  2y  —  0,  >’(0)  =  5,  y' (0)  =  0. 


Solution.  We  make  an  inspired  guess:  might  there  be  a  solution  of  the  form  er,r!  This  seems  at  least 
plausible,  since  in  this  case  y",  y' ,  and  y  all  involve  ert . 

If  such  a  function  is  a  solution  then 


r2ert  —  rert  —  2ert  =  0 
ert(r2  —  r  —  2)  =  0 

(r2  —  r  — 2)  =  0 

(r-2)(r+l)  =  0, 

so  r  is  2  or  —  1.  Not  only  are  f  —  e  and  g  —  e  solutions,  but  notice  that  y  =  Af  +  Bg  is  also,  for  any 
constants  A  and  B: 

(Af  +  Bg)"  (Af  +  Bg)'  -  2(Af  +  Bg)  =  Af"  +  Bg"  —  Af'  —  Bg  —  2Af  —  2Bg 

=  A(f"  -  f- 2 f)+B(g"-g'-2g) 

=  A(0)  +B(0)  —  0. 

Can  we  find  A  and  B  so  that  this  is  a  solution  to  the  initial  value  problem?  Let’s  substitute: 

5  =  y(0)  =  A/(0)  +  5g(0)  =  Ae°  +  Be0  —A  +  B 

and 

0  =  /(0)  =Af'(0)+Bg\0)  =A2e°  +  B(-l)e°  =  2A-B. 

So  we  need  to  solve  this  system  of  two  equations  with  two  unknowns: 

f  A  +  B  —  5 
{  2A-B  =0 

Let  B  —  2 A,  substitute  into  the  first  equation  to  get  5  =  A  +  2A  =  3A.  Then  A  =  5/3  and  B  —  10/3,  and  the 
desired  solution  is  (5/3)e2?  +  (10/3)^.  You  now  see  why  the  initial  condition  in  this  case  included  both 
y(0)  and  / (0) :  We  needed  two  equations  in  the  two  unknowns  A  and  B  Jt» 
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You  should  of  course  wonder  whether  there  might  be  other  solutions,  but  as  it  turns  out,  the  answer  is 
no.  We  will  not  prove  this,  but  here  is  the  theorem  that  tells  us  what  we  need  to  know: 


Theorem  10.19:  Solutions  to  Second  Order  Homogeneous 


Given  the  differential  equation  ay"  +  by1  +  cy  =  0,  a  /  0,  consider  the  quadratic  polynomial  ar2  + 
br  +  c  —  0,  called  the  characteristic  polynomial.  Using  the  quadratic  formula,  this  polynomial 
always  has  one  or  two  roots,  call  them  r\  and  ri-  The  general  solution  of  the  differential  equation 
is: 

(a)  y  —  Ae1'1  +  Ber2‘ ,  if  the  roots  r\  and  ri_  are  real  numbers,  and  r\  /  z  ?. 

(b)  y  —  Aent  +  Bte'2t,  if  r\  =  /'2  is  a  real,  repeated  root. 

(c)  y  —  Acos(f5t)eat  +  B  sin(l5t)eat ,  if  the  roots  are  complex  numbers,  r=a±{5i. 


Example  10.20: 


Solve  the  differential  equation  y"  +A2y  —  0. 


Solution.  First  we  write  the  characteristic  equation,  r2  +A2  =  0.  Then  we  find  the  roots  of  the  characteristic 
equation: 

r2  =  -A2  =>  r=±Ai 

These  are  imaginary  roots,  so  the  solution  of  the  differential  equation  is  in  the  form: 

y  —  c\  cos  (At)  +  C2  sin  (At) 


X 


Example  10.21: 


Solve  the  differential  equation  /'  —  A2y  —  0. 

Solution.  First  we  write  the  characteristic  equation,  r 2  —A2  =  0.  Then  we  find  the  roots  of  the  characteristic 
equation: 

r2  —  A2  =►  r  =  ±A 

These  are  imaginary  roots,  so  the  solution  of  the  differential  equation  is  in  the  form: 

y  =  cleA>  +  c2e-A'=c,eA>  +  ^7 

* 


Example  10.22:  Damped  Spring  Oscillation 


Use  a  differential  equation  to  describe  the  position  of  a  mass  hung  on  a  spring. 
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Solution.  Suppose  a  mass  m  is  hung  on  a  spring  with  spring  constant  k.  If  the  spring  is  compressed  or 
stretched  and  then  released,  the  mass  will  oscillate  up  and  down.  Due  to  friction,  the  oscillation  will 
be  damped:  Eventually  the  motion  will  cease.  The  damping  will  depend  on  the  amount  of  friction;  for 
example,  if  the  system  is  suspended  in  oil  the  motion  will  cease  sooner  than  if  the  system  is  in  air.  Using 
some  simple  physics,  it  is  not  hard  to  see  that  the  position  of  the  mass  is  described  by  the  differential 
equation:  my"  +  by'  +  ky  =  0.  Using  m  =  1,  b  =  4,  and  k  —  5  we  find  the  motion  of  the  mass.  The 
characteristic  polynomial  is  r2  +  4r  +  5  =  0,  with  roots  r  —  ( —4  ±  \/T6^20) /2  =  —  2  ±  i.  Thus  the  general 
solution  is  y  —  Acos(t)e~2t  A- Bsm(t)e~2r .  Suppose  we  know  thaty(O)  =  1  andy'(O)  —  2.  Then  as  before 
we  form  two  simultaneous  equations:  From  y(0)  =  1  we  get  1  =  Acos(0)c°  +  5sin(0)c°  =  A.  For  the 
second  we  compute 

y"  —  —  2Ae~2t  cos(t)  +Ae~2t(—  sin(f))  —  2Be~2t  sin(f)  +Be~2t  cos(t), 

and  then 

2  =  —  2A<?°cos(0)  — Ac°sin(0)  — 2i?£’°sin(0)  +5<?°cos(0)  =  — 2A+5. 

So  we  get  A  =  1,  B  =  4,  and  y  =  cos(t)e~2r  +  4sin(t)e~2t. 

Here  is  a  useful  trick  that  makes  this  easier  to  understand:  We  have  y  —  (cost  +  4 sin t)e~2t .  The 
expression  cost  +  4sint  is  a  bit  reminiscent  of  the  trigonometric  formula  cos(a  —  /3)  =  cos(a)  cos(/3)  + 
sin (ce)  sin(/3)  with  a  —  t.  Let’s  rewrite  it  a  bit  as 

r—  (  1  4 

V17  -=cost-\ — =sint 
Vv^  VT7 

Note  that  (l/\/l7)2  +  (4/\/T7)2  =  1,  which  means  that  there  is  an  angle  /3  with  cos/3  =  1/vT7  and 
sin/3  =  4/ y/ll  (of  course,  /3  may  not  be  a  “nice”  angle).  Then 

cost  +  4sint  =  \/l7  (costcos/3  +  sin/3  sint)  =  \/l7cos(t  — /3). 

Thus,  the  solution  may  also  be  written  y  —  \f\le~2t  cos  (t  —  /3).  This  is  a  cosine  curve  that  has  been  shifted 
/3  to  the  right;  the  \/T7e_2f  has  the  effect  of  diminishing  the  amplitude  of  the  cosine  as  t  increases.  X 

Other  physical  systems  that  oscillate  can  also  be  described  by  such  differential  equations.  Some  elec¬ 
tric  circuits,  for  example,  generate  oscillating  current. 


Example  10.23: 


Find  the  solution  to  the  intial  value  problem  y"  —  Ay  +  4y  =  0,  y(0)  =  —3,  /(0)  =  1 . 


Solution.  The  characteristic  polynomial  is  r2  —  4r  +  4  =  (r  —  2)2,  so  there  is  one  root,  r  =  2,  and  the 
general  solution  is  Ae2t  +  Bte2' .  Substituting  t  —  0  we  get  —3  =  A  +  0  =  A.  The  first  derivative  is  2 Ae2’  + 
2 Bte2'  +Be2t\  substituting  t  —  0  gives  1  —  2A  +  0  +  B  —  2A  +  B  =  2(— 3)  +  B  —  —6  +  B,  so  B  —  7.  The 
solution  is  —  3e2t  +  lte2t .  X 
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Exercises  for  10.5 


Exercise  10.5.1  Solve  the  initial  value  problem  y"  —  a>2y  —  0,  >’(0)  =  1,  y' (0)  =  1,  assuming  ft)  /  0. 

Exercise  10.5.2  Solve  the  initial  value  problem  2 y"  +  18_y  =  0,  >’(0)  =  2,  y'  (0)  =  15. 

Exercise  10.5.3  Solve  the  initial  value  problem  y"  +  6 /  +  5  y  =  0,  >’(())  =  1,  /( 0)  =  0. 

Exercise  10.5.4  Solve  the  initial  value  problem  y"  —  /  —  12 y  =  0,  _y(0)  =  0,  /( 0)  =  14. 

Exercise  10.5.5  Solve  the  initial  value  problem  y"  + 12/  +  36  v  =  0,  _y(0)  =  5,  /(0)  =  — 10. 

Exercise  10.5.6  Solve  the  initial  value  problem  y"  —  8/  +  16y  =  0,  y(0)  =  —3,  /(0)  =  4. 

Exercise  10.5.7  Solve  the  initial  value  problem  y"  +  5y  =  0,  y(0)  =  —2,  /(0)  =  5. 

Exercise  10.5.8  Solve  the  initial  value  problem  y"  +y  =  0,  _y( 7t/4)  =  0,  y(n/ 4)  =  2. 

Exercise  10.5.9  Solve  the  initial  value  problem  y"  + 12/  +  37  y  =  0,  _y(0)  =  4,  /(0)  =  0. 

Exercise  10.5.10  Solve  the  initial  value  problem  y"  +  6 /  +  1 8  v  =  0,  y(0)  =  0,  /(0)  =  6. 

Exercise  10.5.11  Solve  the  initial  value  problem  y"  +  4_y  =  0,  _y(0)  =  a/3,  /(0)  =  2. 

Exercise  10.5.12  Solve  the  initial  value  problem  y"  +  lOOy  =  0,  y(0)  =  5,  /(0)  =  50. 

Exercise  10.5.13  Solve  the  initial  value  problem  y"  +  Ay  +  13y  =  0,  _y(0)  =  1,  /(0)  =  1. 

Exercise  10.5.14  Solve  the  initial  value  problem  y"  —  8 /  +  25  y  =  0,  y(0)  =  3,  /(0)  =  0. 

Exercise  10.5.15  A  mass-spring  system  my"  +  by  +kx  has  k  —  29,  b  —  4,  and  m  —  1.  At  time  t  —  0  the 
position  is  _y(0)  =  2  and  the  velocity  is  /( 0)  =  1.  Find y(t  ). 

Exercise  10.5.16  A  mass-spring  system  my"  +  by  +  kx  has  k  —  24,  b  =  12,  and  m  —  3.  At  time  t  —  0  the 
position  is  y( 0)  =  0  and  the  velocity  is  /( 0)  =  —  1.  Find  y(t). 

Exercise  10.5.17  Consider  the  differential  equation  ay"  A- by'  —  0,  with  a  and  b  both  non-zero.  Find  the 
general  solution  by  the  method  of  this  section.  Now  let  g  =  /;  the  equation  may  be  written  as  ag'  +  bg  =  0, 
a  first  order  linear  homogeneous  equation.  Solve  this  for  g,  then  use  the  relationship  g  —  y  to  find  y. 

Exercise  10.5.18  Suppose  that  y (t)  is  a  solution  to  ay"  +  by'  +  cy  —  0,  y(fi>)  —  0,  /(to)  =  0.  Show  that 

y(0  =  0- 
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10.6  Second  Order  Linear  Equations  -  Method  of  Unde¬ 
termined  Coefficients 


Now  we  consider  second  order  equations  of  the  form  ay"  +  by  +  cy  =  f(t),  with  a,  b,  and  c  constant.  Of 
course,  if  a  —  0  this  is  really  a  first  order  equation,  so  we  assume  a^O.  Also,  if  c  —  0  we  can  solve  the 
related  first  order  equation  ah'  +  bh  —  f(t),  and  then  solve  h  =  y'  for  y.  So  we  will  only  examine  examples 
in  which  0. 

Suppose  that  yi(t)  and  >’2(7)  are  solutions  to  ay" +  by' +cy  —  f(t),  and  consider  the  function  h  =  y\  —  y2. 
We  substitute  this  function  into  the  left  hand  side  of  the  differential  equation  and  simplify: 

a{y\  -yi)"  +  Hyi  -yiY  +  c(y\  ->’2)  =  ay'l  +  by\  +  cy\  -  (ay"  +  by'2  +  cy2 )  =  f(t)  -  f(t )  =  0. 

So  h  is  a  solution  to  the  homogeneous  equation  ay"  +  by  +  cy  —  0.  Since  we  know  how  to  find  all  such 
h,  then  with  just  one  particular  solution  y2  we  can  express  all  possible  solutions  yi,  namely,  y  1  —  h  +  y2, 
where  now  h  is  the  general  solution  to  the  homogeneous  equation.  Of  course,  this  is  exactly  how  we 
approached  the  first  order  linear  equation. 

To  make  use  of  this  observation  we  need  a  method  to  find  a  single  solution  y2.  This  turns  out  to 
be  somewhat  more  difficult  than  the  first  order  case,  but  if  f(t)  is  of  a  certain  simple  form,  we  can  find  a 
solution  using  the  method  of  undetermined  coefficients,  sometimes  more  whimsically  called  the  method 
of  judicious  guessing. 


Example  10.24:  Second  Order  Linear  Equation 


Solve  the  differential  equation  y"  —  y1  -  6  v  —  1 8/2  +  5. 


'iu.  _ r\. 

Solution.  The  general  solution  of  the  homogeneous  equation  is  Ae  +  Be  .  We  guess  that  a  solution  to 
the  non-homogeneous  equation  might  look  like  f(t)  itself,  namely,  a  quadratic  y  =  at2  +  bt  +  c.  Substitut¬ 
ing  this  guess  into  the  differential  equation  we  get 

y"  — /  —  6y  —  2a  —  (2  at  +  b)  —  6  (at2  +  bt  +  c)  —  —hat2  +  (—2a  —  6  b)t  +  (2a  —  b  —  6c). 

We  want  this  to  equal  18t2  +  5,  so  we  need 

—6a  —  18 

—2a  —  6b  =  0 
2a  —  b  —  6c  —  5 

This  is  a  system  of  three  equations  in  three  unknowns  and  is  not  hard  to  solve:  a  —  —3,  b  —  l,  c  —  —2. 
Thus  the  general  solution  to  the  differential  equation  is  Ae3t  +  Be  2t  —  3t2  +  t  —  2.  X 

So  the  “judicious  guess”  is  a  function  with  the  same  form  as  f(t)  but  with  undetermined  (or  better,  yet 
to  be  determined)  coefficients.  This  works  whenever  f(t)  is  a  polynomial. 


Example  10.25:  Mass-Spring  System  with  No  Damping 


Analyze  the  initial  value  problem  my"  +  ky  —  —mg,  y(0)  =  2,  /(())  =  50. 
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Solution.  The  left  hand  side  represents  a  mass-spring  system  with  no  damping,  i.e.,  b  =  0.  Unlike  the 
homogeneous  case,  we  now  consider  the  force  due  to  gravity,  —mg,  assuming  the  spring  is  vertical  at 
the  surface  of  the  earth,  so  that  g  —  980.  To  be  specific,  let  us  take  m  —  1  and  k  —  100.  The  general 
solution  to  the  homogeneous  equation  is  Acos(lOt)  +5sin(10/).  For  the  solution  to  the  non-homogeneous 
equation  we  guess  simply  a  constant  y  —  a,  since  —mg  —  —980  is  a  constant.  Then  y"  +  lOOy  =  100n  so 
a  —  —980/100  =  —9.8.  The  desired  general  solution  is  then  Acos(lOt)  +5sin(10t)  —9.8.  Substituting 
the  initial  conditions  we  get 


2  =  A -9.8 
50  =  10  B 

so  A  =  11.8  and  B  —  5  and  the  solution  is  11.8cos(10t)  +  5  sin(10f)  —  9.8.  4k 

More  generally,  this  method  can  be  used  when  a  function  similar  to  f{t)  has  derivatives  that  are  also 
similar  to  f(t);  in  the  examples  so  far,  since  /(/)  was  a  polynomial,  so  were  its  derivatives.  The  method 
will  work  if  f(t)  has  the  form  p(t)eat cos (/3t)  +  q(t)eat sin(/3t),  where  p{t)  and  q(t)  are  polynomials; 
when  a  =  /3  =  0  this  is  simply  p(t),  a  polynomial.  In  the  most  general  form  it  is  not  simple  to  describe 
the  appropriate  judicious  guess;  we  content  ourselves  with  some  examples  to  illustrate  the  process. 


Solution.  The  characteristic  equation  is  r2  +  7r  +  10  =  (r  +  5)(r  +  2),  so  the  solution  to  the  homoge¬ 
neous  equation  is  Ae~5f  +  Be~2t .  For  a  particular  solution  to  the  inhomogeneous  equation  we  guess  Ce3t. 
Substituting  we  get 

9  Ce3t  +  21  Ce3t  +  10  Ce3t  =  e3?40C. 

When  C  =  1/40  this  is  equal  to  f(t)  —  e3t,  so  the  solution  is  Ae~5t  +Be~2t  +  (l/40)e3r.  4k 


Solution.  Following  the  last  example  we  might  guess  Ce  2t ,  but  since  this  is  a  solution  to  the  homogeneous 
equation  it  cannot  work.  Instead  we  guess  Cte~2t .  Then 

{-2Ce~2t  -  2Ce~2t  +  4Cte~2t)  +  7 (Ce~2t  -  2Cte~2t )  +  10 Cte~2t  =  e~2t(-3C). 

Then  C  —  — 1/3  and  the  solution  is  Ae~5t  +Be~2t  —  (1/3 )te~2t .  4k 

In  general,  if  f{t)  —  ekt  and  k  is  one  of  the  roots  of  the  characteristic  equation,  then  we  guess  Cl ekt 
instead  of  Cekt .  If  k  is  the  only  root  of  the  characteristic  equation,  then  Cl ekI  will  also  not  work,  so  we 
must  guess  Ct2ekt . 
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Solution.  The  characteristic  equation  is  r2  —  6r  +  9  =  (r  —  3)2,  so  the  general  solution  to  the  homogeneous 
equation  is  Ae3t  +  Bte3' .  Guessing  Ct2e3t  for  the  particular  solution,  we  get 

(9Ct2e3t  +  6 Cte3t  +  6 Cte3t  +  2 Ce3t)  -  6(30  +  2 Cte3t)  +  90 V  =  e3t2C. 

Thus,  the  solution  is  Ae3t  +  Bte3t  +  (\  /  2)  t2e3t .  X 

It  is  common  in  various  physical  systems  to  encounter  an  f(t)  of  the  form  acos((Ot)  +&sin(ft)t). 


Example  10.29:  Solving  a  Second  Order  Linear  Equation 


Find  the  general  solution  to  y"  +  6 /  +  25y  =  cos(4/). 


Solution.  The  roots  of  the  characteristic  equation  are  —  3  ±4z,  so  the  solution  to  the  homogeneous  equation 
is  e_3?(Acos(4t)  +5sin(4/)).  For  a  particular  solution,  we  guess  Ccos(4t)  +Dsin(4t).  Substituting  as 
usual: 


(— 16Ccos(4t)  H — 16Dsin(4t))  +  6(— 4Csin(4t)  +4Dcos(4t))  +  25(Ccos(4t)  +Dsin(4t)) 

=  (24D  +  9C)  cos(4t)  +  (-24C  +  9D)  sin(4t). 

To  make  this  equal  to  cos(4t)  we  need 


24D  +  9C  =  1 
9D-24C  =  0 

which  gives  C  =  1/73  and  0  =  8/219.  The  full  solution  is  then  e~3t  (Acos(4t)  +  5sin(4/))  +  ( 1/73)  cos (4/)  + 
(8/219)  sin(4f). 

The  function  e“3?(Acos(4t)  +/?sin(4t))  is  a  damped  oscillation  as  in  example  10.25,  while  (1/73)  cos(4f)  + 
(8/219)  sin(4t)  is  a  simple  undamped  oscillation.  As  t  increases,  the  sum  e~3?(Acos(4r)  +5sin(4t))  ap¬ 
proaches  zero,  so  the  solution 

e~3f(Acos(4t)  +Z?sin(4t))  +  (l/73)cos(4t)  +  (8/219)  sin(4t) 

becomes  more  and  more  like  the  simple  oscillation  (1/73)  cos(4?)  +  (8/219)  sin(4t) — notice  that  the  initial 
conditions  don’t  matter  to  this  long  term  behavior.  The  damped  portion  is  called  the  transient  [part  of  the] 
solution,  and  the  simple  oscillation  is  called  the  steady  state  [part  of  the]  solution.  A  physical  example 
is  a  mass-spring  system.  If  the  only  force  on  the  mass  is  due  to  the  spring,  then  the  behavior  of  the  system 
is  a  damped  oscillation.  If  in  addition  an  external  force  is  applied  to  the  mass,  and  if  the  force  varies 
according  to  a  function  of  the  form  acos(ow)  +  /?sin(ft>/),  then  the  long  term  behavior  will  be  a  simple 
oscillation  determined  by  the  steady  state  portion  of  the  general  solution;  the  initial  position  of  the  mass 
will  not  matter.  X 

As  with  the  exponential  form,  such  a  simple  guess  may  not  work. 


Example  10.30:  Solving  a  Second  Order  Linear  Equation 


Find  the  general  solution  to  y"  +  16y  =  —  sin (4/). 
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Solution.  The  roots  of  the  characteristic  equation  are  ±4/,  so  the  solution  to  the  homogeneous  equa¬ 
tion  is  Acos(4t)  +  B sin(4t) .  Since  both  cos(4f)  and  sin(4f)  are  solutions  to  the  homogeneous  equation, 
Ccos(4t)  +  Dsin(4t)  is  also,  so  it  cannot  be  a  solution  to  the  non-homogeneous  equation.  Instead,  we 
guess  CYcos(4t)  +  Z)f  sin(4t).  Then  substituting: 

(  —  16Ctcos(4t)  —  16Dsin(4t)  +  8Z)cos(4t)  —  8Csin(4t)))  +  16(C7cos(4t)  +  Df  sin(4t)) 

=  8Z)cos(4t)  —  8Csin(4t). 

Thus  C  —  l/S,D  —  0,  and  the  solution  is  Ccos(4t)  +  Dsin(4t)  +  (l/8)tcos(4f).  X 

In  general,  if  /(f)  —  acos(cot)  +bsin(cot),  and  ±(oi  are  the  roots  of  the  characteristic  equation,  then 
instead  of  Ccos((0f)  +Dsin(cot)  we  guess  Ctcos(cot)  +Dtsin(cot). 


Exercises  for  10.6 


Find  the  general  solution  to  the  differential  equation. 
Exercise  10.6.1  y"  —  10/  +  25  y  —  cos  t 

Exercise  10.6.2  y"  +  2/2/  +  2_y  =  10 

Exercise  10.6.3  y"  +  I6y  —  St2  +  3t  —  4 

Exercise  10.6.4  y"  +  2y  —  cos(5 1)  +  sin(5t) 

Exercise  10.6.5  y"  —  2 /  +  2 y  —  e2t 

Exercise  10.6.6  y"  —  6y  +  13  =  1  +  2 t  +  e~T 

Exercise  10.6.7  /'  +  /  -  6y  =  e  3; 

Exercise  10.6.8  y"  —  4/  +  3y  —  e2t 

Exercise  10.6.9  y"  +16 y  —  cos (4 1) 

Exercise  10.6.10  y"  +  9y  —  3  sin(3f ) 

Exercise  10.6.11  y"  +  12/  +  36  y  =  6e~6t 

Exercise  10.6.12  y" -  8/  +  16y  =  -2eAt 

Exercise  10.6.13  y"  +  6/  +  5y  =  4 

Exercise  10.6.14  /'  —  /  —12 y  —  t 
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Exercise  10.6.15  y"  +  5y  —  8  sin(2/) 

Exercise  10.6.16  y"  -4 y  =  4e2t 


Solve  the  initial  value  problem. 

Exercise  10.6.17  y"-y  =  3/ +  5,  y(0)  =  0,  /(0)  -  0 

Exercise  10.6.18  y"  +  9 y  =  4 1,  y(0)  =  0,  / (0)  =  0 

Exercise  10.6.19  y"  +  12/  +  37y  =  I0e~4t,  y(0)  =  4,  /( 0)  =  0 

Exercise  10.6.20  y"  +  6/  +  18y  =  cost  —  sin/,  y(0)  =  0,  /( 0)  =  2 

Exercise  10.6.21  Find  the  solution  for  the  mass-spring  equation  y"  +  4 /  +  29  y  —  689cos(2/). 

Exercise  10.6.22  Find  the  solution  for  the  mass-spring  equation  3  y"  +  12/  +  24y  =  2  sin/. 

Exercise  10.6.23  Consider  the  differential  equation  my "  +  by  +ky  —  cos  (tot),  with  m,  b,  and  k  all  positive 
and  b  <  2 ink;  this  equation  is  a  model  for  a  damped  mass-spring  system  with  external  driving  force 
cos  (ft)/).  Show  that  the  steady  state  part  of  the  solution  has  amplitude 

1 


\J  (k  —  m(0 2)2  +  C02b 2 
/  4n)^  —  2  b2 

Show  that  this  amplitude  is  largest  when  ft)  = - - - .  This  is  the  resonant  frequency  of  the  system. 

2m 


10.7  Second  Order  Linear  Equations  - 
Variation  of  Parameters 


The  method  of  the  last  section  works  only  when  the  function  /(/)  in  ay"  +  by  +  cy  —  f(t)  has  a  particularly 
nice  form,  namely,  when  the  derivatives  of  /  look  much  like  /  itself.  In  other  cases  we  can  try  variation 
of  parameters  as  we  did  in  the  first  order  case. 

Since  as  before  a  /  0,  we  can  always  divide  by  a  to  make  the  coefficient  of  y"  equal  to  1.  Thus,  to 
simplify  the  discussion,  we  assume  a  =  1.  We  know  that  the  differential  equation  y"  +  by'  +  cy  —  0  has  a 
general  solution  y  =  Ay  \  +  By 2-  As  before,  we  guess  a  particular  solution  to  y"  +  by  +  cy  —  /(/);  this  time 
we  use  the  guess  y  —  u(t)y\  +  v(t)y 2.  Compute  the  derivatives: 

/  =  u'y  1  +  uy[  +  v'y2  +  vy2 

y"  =  u"y\  +  uy\  +  u'y\  +  uy"  +  v"y2  +  v'y'2  +  v'y'2  +  vy". 

Now  substituting: 

y"  +  by  +  cy  =  u"y\  +  uy\  +  uy\  +  uy"  +  v"y2  +  v'y'2  +  v'y'2  +  vy" 
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+bu'y  i  +  buy)  +  bv'y2  +  bvy2  +  cuy  \  +  cvy2 
=  ( uy'l  +  buy\+ cuy  1 )  +  ( vy2  +  bvy'2  +  cvy2 ) 

+b(u'yi  +  v'yf)  +  {u"y\  +  uy\  +  v"y2  +  v'y2)  +  (uy\  +  v'y2 ) 

=  0  +  0  +  b(uyi  +  vy2 )  +  {u”y\  +  uy\  +  v"  y2  +  vy2 )  +  ( uy\  +  vy2 ) . 

The  first  two  terms  in  parentheses  are  zero  because  and  y2  are  solutions  to  the  associated  homogeneous 
equation.  Now  we  engage  in  some  wishful  thinking.  If  u!y\  +  v'y2  =  0,  then  we  also  have  u" y  \  +  u'y\  + 
v"y2  +  v'y'2  =  0  by  taking  derivatives  of  both  sides.  This  reduces  the  entire  expression  to  u'y\  +  vy2  —  0. 
We  want  this  to  be  f(t),  that  is,  we  need  u'y\  +  vy2  —  f(t).  So  we  would  very  much  like  these  equations 
to  be  true: 

uy  i  +  v'y2  =  0 
uy\  +  vy2  =  f{t). 

This  is  a  system  of  two  equations  in  the  two  unknowns  u'  and  v  ,  so  we  can  solve  as  usual  to  get  u'  —  g{t ) 
and  v'  —  h{t).  Then  we  can  find  u  and  v  by  computing  antiderivatives.  This  is  of  course  the  sticking  point 
in  the  whole  plan,  since  the  antiderivatives  may  be  impossible  to  find.  Nevertheless,  this  sometimes  works 
out  and  is  worth  a  try. 


Example  10.31:  Variation  of  Parameters 


Consider  the  equation  y"  —  5/  +  6y  =  sin  t.  Solve  using  variation  of  parameters. 


It 

Solution.  The  solution  to  the  homogeneous  equation  is  Ae  +  Be  ,  so  the  simultaneous  equations  to  be 
solved  are 


u'e2t  +  vre3t  —  0 
2ur  e2t  +  3vr  e3t  —  sint. 


If  we  multiply  the  first  equation  by  2  and  subtract  it  from  the  second  equation  we  get 


v'e3t  —  sint 
v  —  e-3/sin  t 


v  =  —  ^(3sint  +  cost)e  3f, 


using  integration  by  parts.  Then  from  the  first  equation: 

-It  —3 1 


u  =  -e  2  v  e3  =  —e  e  3  sin (t)<?3  = -e  2  sint 

1 

u  —  -(2sint +  cost)e  . 


Now  the  particular  solution  we  seek  is 


=  ^(2sint  +  cost)e  2t e2t  —  ^(3sint  +  cost)e  3te3t 

—  ^(2sint  +  cost)  —  ^(3sint  +  cost) 

1  .  .  , 

=  — (sint  +  cost), 


ue2t  +  ve3t 
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and  the  solution  to  the  differential  equation  is  Ae2t  +  Be31  +  (sin t  +  cos  t) / 10.  For  comparison  (and  prac¬ 
tice)  you  might  want  to  solve  this  using  the  method  of  undetermined  coefficients — both  techniques  should 
yield  the  same  result.  X 


Example  10.32:  Variation  of  Parameters 


The  differential  equation  y"  —  5/  +  6y  =  e‘  sin/  can  be  solved  using  the  method  of  undetermined 
coefhcients,  though  we  have  not  seen  any  examples  of  such  a  solution.  Again,  we  will  solve  it  by 
variation  of  parameters. 


Solution  The  equations  to  be  solved  are 


/  2t  ,  /  3t  n 

u  e  +ve  —  0 

2  u'e2t  +  3v'e3t  —  e*  sin  t. 

If  we  multiply  the  first  equation  by  2  and  subtract  it  from  the  second  equation  we  get 

v’e3t  —  e1  sin  t 

v  —  e~3V  sin/  =  e~2t  sin/ 

1  o, 

v  =  — -(2sin/  +  cos/)<? 


Then  substituting  we  get 


u  —  —  e  2tv'e3t  —  —e  2te  2fsin {t)e3t  — —e  fsin t 
u  —  -(sint  +  cost)e”?. 


The  particular  solution  is 


ue2t  +  v<?3f  = 


^(sint-|-cost)e“fe2f  —  ^(2sint  +  cost)e~2t  e3t 
^(sint-|-cost)ef  —  ^(2sin/  +  cost)ef 
^(sin/-|-3cost)ef, 


and  the  solution  to  the  differential  equation  is  Ae2t  +  BeM  +  / (sin/  +  3  cos /) / 10. 


Jit  I  J  i 


X 


Example  10.33:  Solving  a  DE 


The  differential  equation  y"  —  2 y  +  _y  =  el  f  t2  is  not  of  the  form  amenable  to  the  method  of  undeter¬ 
mined  coefficients.  Solve  it  using  variation  of  parameters. 


Solution.  The  solution  to  the  homogeneous  equation  is  Ael  +  Ble1  and  so  the  simultaneous  equations  are 

u1 el  +  v  tel  —  0 

mV  +  v'te1  +  v  el  —  . 

t2 
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Subtracting  the  equations  gives 


Then  substituting  we  get 


/ 

v  = 

v  = 


t2 

1 


/ 

u 

u 


—vte*  —  — -p:tJ 
t2 

1 

t 

—  lnt. 


The  solution  is  Ae 1  +Bte(  —  e1  Inf  —  er . 


X 


Exercises  for  10.7 

Find  the  general  solution  to  the  differential  equation  using  variation  of  parameters. 

Exercise  10.7.1  y"  +  y  =  tanx 

Exercise  10.7.2  /'  +y  =  e2‘ 

Exercise  10.7.3  y"  +  4v  =  secx 

Exercise  10.7.4  y"  +  4 y  —  tanx 

Exercise  10.7.5  y”  +  y  -6 y  =  t2e2t 

Exercise  10.7.6  y"  —  2/  +  2 y  —  el  tan(t) 

Exercise  10.7.7  y"  —  2  y  +  2 y  —  si  n  (/ )  cos(t)  (This  is  rather  messy  when  done  by  variation  of  parameters; 
compare  to  undetermined  coefficients.) 


11.  Polar  Coordinates,  Parametric  Equations 


11.1  Polar  Coordinates 


Coordinate  systems  are  tools  that  let  us  use  algebraic  methods  to  understand  geometry.  While  the  rectan¬ 
gular  (also  called  Cartesian)  coordinates  that  we  have  been  using  are  the  most  common,  some  problems 
are  easier  to  analyze  in  alternate  coordinate  systems. 

A  coordinate  system  is  a  scheme  that  allows  us  to  identify  any  point  in  the  plane  or  in  three-dimensional 
space  by  a  set  of  numbers.  In  rectangular  coordinates  these  numbers  are  interpreted,  roughly  speaking,  as 
the  lengths  of  the  sides  of  a  rectangle.  In  polar  coordinates  a  point  in  the  plane  is  identified  by  a  pair  of 
numbers  (r,  0).  The  number  0  measures  the  angle  between  the  positive  jc-axis  and  a  ray  that  goes  through 
the  point,  as  shown  in  Figure  12.15;  the  number  r  measures  the  distance  from  the  origin  to  the  point  (the 
radius).  Figure  12.15  shows  the  point  with  rectangular  coordinates  (l,x/3)  and  polar  coordinates  (2,  tt/3 ) , 
2  units  from  the  origin  and  k/3  radians  from  the  positive  ;t-axis. 


(2, ft/3) 


Figure  11.1:  Polar  coordinates  of  the  point  (1,  \/3). 


Just  as  we  describe  curves  in  the  plane  using  equations  involving  x  and  y,  so  can  we  describe  curves 
using  equations  involving  r  and  0.  Most  common  are  equations  of  the  form  r  =  f  (6). 


Solution.  All  points  with  r  —  2  are  at  distance  2  from  the  origin,  so  r  —  2  describes  the  circle  of  radius  2 
with  center  at  the  origin.  Jl» 
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Solution.  We  first  consider  y  =  1  +  cosx,  as  in  figure  11.2.  As  0  goes  through  the  values  in  [0, 2tt] ,  the 
value  of  r  tracks  the  value  of  y,  forming  the  “cardioid”  shape  of  figure  11.2.  For  example,  when  0  —  k/2, 
r  =  1  +  cos(7t/2)  =  1,  so  we  graph  the  point  at  distance  1  from  the  origin  along  the  positive  y-axis,  which 
is  at  an  angle  of  k/  2  from  the  positive  x-axis.  When  0  =  Ik /4,  r  =  1  +  cos(7;r/4)  =  1  +  \/2/2  ps  1.71, 
and  the  corresponding  point  appears  in  the  fourth  quadrant.  If  we  look  at  the  curve,  we  should  explain  the 
indentation  which  occurs  at  6  —  k.  It  should  be  obvious  that  cosx  <  0  on  (k/2,  3k/ 2),  thus  r  =  1  +cos0  < 
1 .  Specifically,  when  0  —  K,  cos  k  =  —  1 ,  so  r  =  0,  and  we  get  the  resulting  indentation.  This  illustrates  one 
of  the  potential  benefits  of  using  polar  coordinates:  the  equation  for  this  curve  in  rectangular  coordinates 
would  be  quite  complicated.  & 


Figure  11.2:  A  cardioid:  y  =  1  +  cosx  on  the  left,  r  =  1  +  cos  0  on  the  right. 


Each  point  in  the  plane  is  associated  with  exactly  one  pair  of  numbers  in  the  rectangular  coordinate 
system;  each  point  is  associated  with  an  infinite  number  of  pairs  in  polar  coordinates.  In  the  cardioid 
example,  we  considered  only  the  range  0  <  6  <  2k,  and  already  there  was  a  duplicate:  (2, 0)  and  (2,  2k) 
are  the  same  point.  Indeed,  every  value  of  0  outside  the  interval  [0,  2k)  duplicates  a  point  on  the  curve 
r  —  1  +cos  0  when  0  <  0  <  2k.  We  can  even  make  sense  of  polar  coordinates  like  (— 2,  tt/4):  Go  to  the 
direction  k/A  and  then  move  a  distance  2  in  the  opposite  direction;  see  figure  11.3.  As  usual,  a  negative 
angle  0  means  an  angle  measured  clockwise  from  the  positive  x-axis.  The  point  in  figure  11.3  also  has 
coordinates  {2,5k /A)  and  (2,  — 3tt/4). 


'k/A 


-2  -1/ 
/-\ 

* 


1  2 


Figure  11.3:  The  point  {—2, k/A)  =  {2,5k /A)  =  (2,  — 3tt/4)  in  polar  coordinates. 


The  relationship  between  rectangular  and  polar  coordinates  is  quite  easy  to  understand.  The  point  with 
polar  coordinates  (r,  0)  has  rectangular  coordinates  x  =  rcos  0  and  y  =  rsin  0;  this  follows  immediately 
from  the  definition  of  the  sine  and  cosine  functions.  Using  figure  1 1.3  as  an  example,  the  point  shown  has 
rectangular  coordinates  x  =  (— 2)cos(?r/4)  =  —  V2&  1.4142  and  y—  (— 2)  sin(7r/4)  =  —  \p2.  This  makes 
it  very  easy  to  convert  equations  from  rectangular  to  polar  coordinates. 
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Example  11.3:  Straight  Line  in  Polar  Coordinates 


Find  the  equation  of  the  line  y  —  3x  +  2  in  polar  coordinates. 


Solution.  We  merely  substitute:  r  sin  9  =  3rcos  9  +  2,  or  r  =  — — - — - -.  X 

sm  9  —  3  cos  0 


Example  11.4:  Equation  of  a  Circle 


Find  the  equation  of  the  circle  (x  — 1/2 ) 2  +  y2  =  1/4  in  polar  coordinates. 


Solution.  Again  substituting:  (rcos  0  —  1/2)2  +  r2  sin2  9  —  1/4.  A  bit  of  algebra  turns  this  into  r  —  cos (t). 
You  should  try  plotting  a  few  (r,  9)  values  to  convince  yourself  that  this  makes  sense.  X 


Solution.  Here  the  distance  from  the  origin  exactly  matches  the  angle,  so  a  bit  of  thought  makes  it  clear 
that  when  9  >  0  we  get  the  spiral  of  Archimedes  in  figure  1 1.4.  When  9  <  0,  r  is  also  negative,  and  so  the 
full  graph  is  the  right  hand  picture  in  the  figure.  X 


Figure  11.4:  The  spiral  of  Archimedes  and  the  full  graph  of  r  =  9. 


Converting  polar  equations  to  rectangular  equations  can  be  somewhat  trickier,  and  graphing  polar 
equations  directly  is  also  not  always  easy. 


Solution.  Given  that  the  sine  is  periodic,  we  know  that  we  will  get  the  entire  curve  for  values  of  9  in 
[0,27c).  As  9  runs  from  0  to  k/2,  r  increases  from  0  to  2.  Then  as  9  continues  to  k,  r  decreases  again 
to  0.  When  9  runs  from  n  to  2k,  r  is  negative,  and  it  is  not  hard  to  see  that  the  first  part  of  the  curve  is 
simply  traced  out  again,  so  in  fact  we  get  the  whole  curve  for  values  of  9  in  [0,7r).  Thus,  the  curve  looks 
something  like  figure  11.5.  Now,  this  suggests  that  the  curve  could  possibly  be  a  circle,  and  if  it  is,  it 
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would  have  to  be  the  circle  x2  +  (y  —  l)2  =  1.  Having  made  this  guess,  we  can  easily  check  it.  First  we 
substitute  for  x  and  y  to  get  (rc os  0)2  +  (rsinO  —  l)2  =  1;  expanding  and  simplifying  does  indeed  turn  this 
into  r  —  2  sin  0 .  X 


Figure  11.5:  Graph  of  r  —  2sin0. 


Exercises  for  11.1 


Exercise  11.1.1  Plot  these  polar  coordinate  points  on  one  graph:  (2,  tt/3),  (— 3,  7t/2),  (—2,  —  tt/4), 
(1/2,  tt),  ( 1 , 4tt/ 3),  (0,37c/2). 

Exercise  11.1.2  Find  an  equation  in  polar  coordinates  that  has  the  same  graph  as  the  given  equation  in 
rectangular  coordinates. 


(a)  y  —  3x 

(d)  x2  +y2  —  5 

(g)  y 

(b)  y  =  —4 

(e)  y  =  x3 

(h)  x 

(c)  xy2  =  1 

(f)  y  =  sinx 

d)  y 

Exercise  11.1.3  Sketch  the  following 

cur\’es: 

(a)  r  —  cos  9 

(e)  r  -  e/2,  e  >  0 

( b)  r  —  1  +  6l /n2 

(f)  r  —  cot  0  esc  9 

(c)  r  —  sin(0  +  tz/A) 

1 

(g)  r=  ■  n  ,  a 

sin  9  +  cos  9 

(d)  r——sec0 

(h)  r2  —  —2  sec  9  esc  9 

Exercise  11.1.4  Find  an  equation  in  rectangular  coordinates  that  has  the  same  graph  as  the  given  equa¬ 
tion  in  polar  coordinates. 
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(a)  r  =  sin(30) 

(c) 

r  =  sec  0  esc  0 

(b)  r  =  sin* 2  0 

(d) 

r  =  tan  0 

11.2  Slopes  in  Polar  Coordinates 


When  we  describe  a  curve  using  polar  coordinates,  it  is  still  a  curve  in  the  x-y  plane.  We  would  like  to  be 
able  to  compute  slopes  and  areas  for  these  curves  using  polar  coordinates. 

We  have  seen  that  x  —  r  cos  0  and  y  —  r  sin  0  describe  the  relationship  between  polar  and  rectangular 
coordinates.  If  in  turn  we  are  interested  in  a  curve  given  by  r  —  f(0),  then  we  can  write  x  —  f(6)  cos  0  and 
y  —  f(0)  sin0,  describing  x  and  y  in  terms  of  0  alone.  The  first  of  these  equations  describes  0  implicitly 
in  terms  of  x,  so  using  the  chain  rule  we  may  compute 

dy  dy  d  0 
dx  d  0  dx 

Since  dO  /dx  —  1  /(dx/dQ),  we  can  instead  compute 

dy  dy/d6  /(0)  cos  0 +/'(0)  sin0 
dx  dx/dO  — /(0)sin0+/'(0)cos0 


Example  11.7:  Horizontal  Tangent  Line 


Find  the  points  at  which  the  curve  given  by  r  —  1  +  cos  0  has  a  vertical  or  horizontal  tangent  line. 


Solution.  Since  this  function  has  period  2n,  we  may  restrict  our  attention  to  the  interval  [0,  2k)  or  (— k,  k]  , 
as  convenience  dictates.  First,  we  compute  the  slope: 

dy  (1  +cos0)cos0  —  sin  0  sin  0  cos  0  +  cos2  0  —  sin2  0 
dx  —  (1 +cos0)sin0  —  sin0cos0  —  sin0  —  2sin0cos0 

This  fraction  is  zero  when  the  numerator  is  zero  (and  the  denominator  is  not  zero).  The  numerator  is 

r\ 

2  cos  0  +  cos  0  —  1  so  by  the  quadratic  formula 


cos  0 


—  1  ±  \/l  +4  ■  2 
4 


or 


1 

2' 


This  means  0  is  K  or  ±7t/3.  However,  when  0  =  K,  the  denominator  is  also  0,  so  we  cannot  conclude  that 
the  tangent  line  is  horizontal. 

Setting  the  denominator  to  zero  we  get 


-0  —  2  sin  0  cos  0  =  0 

sin0(l +2cos  0)  =  0, 
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so  either  sin0  =  0  or  cos  0  =  —1/2.  The  first  is  true  when  0  is  0  or  k,  the  second  when  9  is  27T/3  or 
4tt/3.  However,  as  above,  when  9  —  K,  the  numerator  is  also  0,  so  we  cannot  conclude  that  the  tangent 
line  is  vertical.  Figure  11.6  shows  points  corresponding  to  9  equal  to  0,  ±1.318,  27t/3  and  4tt/3  on  the 
graph  of  the  function.  Note  that  when  9  —  7t  the  curve  hits  the  origin  and  does  not  have  a  tangent  line. 

X 


Figure  11.6:  Points  of  vertical  and  horizontal  tangency  for  r  =  1  ±  cos  9. 

We  know  that  the  second  derivative  f"{x)  is  useful  in  describing  functions,  namely,  in  describing 
concavity.  We  can  compute  f"(x)  in  terms  of  polar  coordinates  as  well.  We  already  know  how  to  write 
dy/dx  —  y'  in  terms  of  9,  then 

d  dy  dy'  dy' d 9  dyr/d9 
dxdx  dx  d9  dx  dx/d9 


Example  11.8:  Second  Derivative  of  Cardioid 


Find  the  second  derivative  for  the  cardioid  r—  1  ±  cos  9. 


Solution. 

3(l±cos0)  1 

(sin0  ±2sin0cos0)2  —  (sin0  ±2sin0cos0) 

—3(1  ±  cos  0) 

(sin0  ±2sin0cos0)3’ 

The  ellipsis  here  represents  rather  a  substantial  amount  of  algebra.  We  know  from  above  that  the  cardioid 
has  horizontal  tangents  at  ±;r/3;  substituting  these  values  into  the  second  derivative  we  get  y"(K/3)  — 
—  y/3/2  and  y"{—%/ 3)  =  x/3/2,  indicating  concave  down  and  concave  up  respectively.  This  agrees  with 
the  graph  of  the  function.  X 


d  cos  0  ±  cos2  0  —  sin2  0  1 

d9  — sin0  — 2sin0cos0  dx/d9 


Exercises  for  11.2 


Exercise  11.2.1  Compute  y'  =  dy/dx  cindy"  —  d2y/dx2. 
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(a)  r—0 

(b)  r  —  1  +  sin0 

(c)  r  —  cos  0 


(d)  r  —  sin0 

(e)  r  —  secO 

(f)  r  —  sin(20) 


(c)  r  —  -  +  sin0 

2 

(d)  r  =  2  +  cos0 

(e)  r  —  ^  +  cos0 

(f)  r  —  cos(0/2),O  <  0  <  4k 


(i)  r—  1  +cos2(20) 

(j)  r  —  sin2(30) 

(k)  r  =  tan0 

(l)  r  —  sec(0/2),O  <  0  <  An 
(m)  r—  1  +  sec0 


1 

1  —  cos  0 
1 

1  +  sin  0 

(p)  r  =  cot(20) 

(q)  r  —  K/6,0<B<°° 

(r)  r  —  1  +  7r/0,O  <  0  <  °° 


Exercise  11.2.2  Sketch  the  curves  over  the  interval  [0,2tt]  unless  otherwise  stated. 


(a)  r  —  sin  0  +  cos  0 

(g)  r  —  sin(0/3),O  <  0  <  6n 

(n)  r 

( b)  r  —  2  +  2sin0 

(h)  r  —  sin2  0 

(o) 


11.3  Areas  in  Polar  Coordinates 


We  can  use  the  equation  of  a  curve  in  polar  coordinates  to  compute  some  areas  bounded  by  such  curves. 
The  basic  approach  is  the  same  as  with  any  application  of  integration:  Find  an  approximation  that  ap¬ 
proaches  the  true  value.  For  areas  in  rectangular  coordinates,  we  approximated  the  region  using  rectan¬ 
gles;  in  polar  coordinates,  we  use  sectors  of  circles,  as  depicted  in  Figure  11.7.  Recall  that  the  area  of  a 
sector  of  a  circle  is  ar2/ 2,  where  a  is  the  angle  subtended  by  the  sector.  If  the  curve  is  given  by  r  —  f(6), 
and  the  angle  subtended  by  a  small  sector  is  A0,  the  area  is  (A0)(/(0))2/2.  Thus  we  approximate  the 
total  area  as 

nt\m2  A0- 

7=0  Z 


As  a  limit,  this  will  give  rise  to: 

rb  1 

l  p(e)2de. 


Solution. 


1 

-  1  +  2  cos  0  +  cos2  0  cl 6 

2  Jo 


1 

2 


^0  +2sin0  + 


sin  20  \ 

~  J 


2n 


3  K 
~2~' 


o 
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* 

As  in  the  example  above,  if  a  function  is  defined  over  the  complete  region  [0,27t],  then  the  bounds  for 
the  integral  are  a  —  0,  b  —  2k.  However,  sometimes  we  need  to  be  careful! 


Solution.  We  look  for  points  of  intersection. 


0  =  VcosO 
0  =  cos  0 

This  has  solutions  0  —  n/2  and  3k/ 2,  but  we  must  select  our  bounds  for  integration  correctly.  Since  cos  0 
is  negative  on  {n /2,3k/ 3),  a/cos 6  is  undefined.  Note  that  one  full  revolution  is  2k\  that  is,  the  function 
is  2k  periodic.  If  we  were  to  integrate  on  [37t/2,7t/2]  we  would  get  the  same  answer,  and  can  avoid  a 
scenario  in  which  V cos  0  is  undefined.  Equivalently,  we  can  integrate  on  [— k/2,  k/2]. 


Figure  11.7:  Approximating  area  by  sectors  of  circles. 


Example  11.11:  Area  Between  Circles 


Find  the  area  between  the  circles  r  =  2  and  r  =  4  sin  0 ,  as  shown  in  figure  11.8. 


Solution.  The  two  curves  intersect  where  2  =  4sin0,  or  sin  6 
want  is  then 


1  r5*/6  ,  4 

-  /  16sin“0— 4 d0  — - 

2  J jt/6  3 


=  1/2,  so  6  —  k/6  or  5tt/6.  The  area  we 
K  +  2\[2. 


X 
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Figure  11.8:  An  area  between  curves. 

This  first  example  makes  the  process  appear  more  straightforward  than  it  is.  Since  points  have  many 
different  representations  in  polar  coordinates,  it  is  not  always  so  easy  to  identify  points  of  intersection. 


Example  11.12:  Shaded  Area 


Find  the  shaded  area  in  the  first  graph  of  figure  11.9  as  the  difference  of  the  other  two  shaded  areas. 
The  cardioid  is  r  —  1  +  sin  0  and  the  circle  is  r  —  3  sin  0 . 


Solution.  We  attempt  to  find  points  of  intersection: 

1  +  sin  0  —  3  sin  0 
1  =  2sin0 
1/2  =  sin0. 


This  has  solutions  0  —  k/ 6  and  5tt/6;  k/6  corresponds  to  the  intersection  in  the  first  quadrant  that  we 
need.  Note  that  no  solution  of  this  equation  corresponds  to  the  intersection  point  at  the  origin,  but  fortu¬ 
nately  that  one  is  obvious.  The  cardioid  goes  through  the  origin  when  0  —  —  7r/2;  the  circle  goes  through 
the  origin  at  multiples  of  j r,  starting  with  0. 

Now  the  larger  region  has  area 


1  r71 /6 

-  (1  +  sin0)2  dO 

2  J— jt/2 


and  the  smaller  has  area 


1  r71/6 

-  /  (3sin0)2  dQ 

2  Jo 


so  the  difference  is  the  area  we  seek,  which  is  tt/8. 


3  K 

T 


X 
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Figure  11.9:  An  area  between  curves. 


Exercises  for  11.3 


Exercise  11.3.1  Find  the  area  enclosed  by  the  cur\’e. 

(a)  r  —  \/sin0 

(b)  r  —  2  +  cos0 

(c)  r  —  sec0,7r/6  <  0  <  7t/3 


(d)  r  =  cos0,O  <  0  <  n/3 

(e)  r  —  2acos0,a  >  0 

(f)  r  —  4  +  3  sin  0 


Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 


11.3.2  Find  the  area  inside  the  loop  formed  by  r  —  tan  (0/2). 

11.3.3  Find  the  area  inside  one  loop  ofr  —  cos(30). 

11.3.4  Find  the  area  inside  one  loop  ofr  —  sin-  0. 

11.3.5  Find  the  area  inside  the  small  loop  ofr  —  (1/2)  +  cos  0. 

11.3.6  Find  the  area  inside  r  —  (1/2)  +  cos  0,  including  the  area  inside  the  small  loop. 

11.3.7  Find  the  area  inside  one  loop  ofr 2  —  cos(20). 

esc  0 

11.3.8  Find  the  area  enclosed  by  r  =  tan  0  and  r  =  — — . 

V2 

11.3.9  Find  the  area  inside  r  —  2  cos  0  and  outside  r  =  1. 

11.3.10  Find  the  area  inside  r  —  2  sin  0  and  above  the  line  r  —  (3/2)  esc  0. 

11.3.11  Find  the  area  inside  r  —  6,  0  <  0  <  2k. 
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Exercise  11.3.12  Find  the  area  inside  r  =  Vo,  0  <  9  <  2k. 

Exercise  11.3.13  Find  the  area  inside  both  r  —  v^3  cos  9  and  r  —  sin  9. 

Exercise  11.3.14  Find  the  area  inside  both  r  =  1  —  cos  9  and  r  —  cos  9. 

Exercise  11.3.15  The  center  of  a  circle  of  radius  1  is  on  the  circumference  of  a  circle  of  radius  2.  Find 

the  area  of  the  region  inside  both  circles. 

Exercise  11.3.16  Find  the  shaded  area  in  figure  11.10.  The  cur\>e  is  r  —  9,  0  <  9  <  3k. 


Figure  11.10:  An  area  bounded  by  the  spiral  of  Archimedes. 


11.4  Parametric  Equations 


When  we  computed  the  derivative  dy/dx  using  polar  coordinates,  we  used  the  expressions  x  —  f{9)  cos  9 
andy  —  f(9)  sin9.  These  two  equations  completely  specify  the  curve,  though  the  form  r  —  f(9)  is  simpler. 
The  expanded  form  has  the  virtue  that  it  can  easily  be  generalized  to  describe  a  wider  range  of  curves  than 
can  be  specified  in  rectangular  or  polar  coordinates. 

Suppose  f(t)  and  g(t)  are  functions.  Then  the  equations  x  =  f(l)  and  y  =  g(t)  describe  a  curve  in  the 
plane.  In  the  case  of  the  polar  coordinates  equations,  the  variable  t  is  replaced  by  9  which  has  a  natural 
geometric  interpretation.  In  general,  t  is  simply  an  arbitrary  variable,  often  called  in  this  case  a  parameter, 
and  this  method  of  specifying  a  curve  is  known  as  parametric  equations.  One  important  interpretation  of 
t  is  time.  In  this  interpretation,  the  equations  x  —  f(t )  and  y  =  g(t)  give  the  position  of  an  object  at  time  t. 


Example  11.13:  Position  of  a  Path 


Describe  the  path  of  an  object  that  moves  so  that  its  position  at  time  t  is  given  byx  —  cos  t,  y  —  cos2 1. 
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Solution.  We  see  immediately  that  y  —  x  ,  so  the  path  lies  on  this  parabola.  The  path  is  not  the  entire 
parabola,  however,  since  x  —  cos  t  is  always  between  —1  and  1.  It  is  now  easy  to  see  that  the  object 
oscillates  back  and  forth  on  the  parabola  between  the  endpoints  (1,1)  and  (  —  1,1),  and  is  at  point  (1,1)  at 
time  t  =  0.  £ 

It  is  sometimes  quite  easy  to  describe  a  complicated  path  in  parametric  equations  when  rectangular 
and  polar  coordinate  expressions  are  difficult  or  impossible  to  devise. 


Example  11.14:  Wheel 


A  wheel  of  radius  1  rolls  along  a  straight  line,  say  the  x-axis.  A  point  on  the  rim  of  the  wheel  will 
trace  out  a  curve,  called  a  cycloid.  Assume  the  point  starts  at  the  origin;  find  parametric  equations 
for  the  curve. 


Solution.  Figure  11.11  illustrates  the  generation  of  the  curve.  The  wheel  is  shown  at  its  starting  point,  and 
again  after  it  has  rolled  through  about  490  degrees.  We  take  as  our  parameter  t  the  angle  through  which 
the  wheel  has  turned,  measured  as  shown  clockwise  from  the  line  connecting  the  center  of  the  wheel  to 
the  ground.  Since  the  radius  is  1,  the  center  of  the  wheel  has  coordinates  (t,  1).  We  seek  to  write  the 
coordinates  of  the  point  on  the  rim  as  (t  +  At,  1  +  Ay),  where  Ax  and  Ay  are  as  shown  in  figure  11.12. 
These  values  are  nearly  the  sine  and  cosine  of  the  angle  t,  from  the  unit  circle  definition  of  sine  and  cosine. 
However,  some  care  is  required  because  we  are  measuring  t  from  a  nonstandard  starting  line  and  in  a 
clockwise  direction,  as  opposed  to  the  usual  counterclockwise  direction.  A  bit  of  thought  reveals  that 
At  =  —  sint  and  Ay  =  —  cost.  Thus  the  parametric  equations  for  the  cycloid  are  x  =  t  —  sint,  y  =  1  —  cost. 

* 


Figure  11.11:  A  cycloid. 


Figure  11.12:  The  wheel. 
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Exercises  for  11.4 


•  9  A 

Exercise  11.4.1  What  curve  is  described  by  x  —  t  ,  y  —  t  ?  If  t  is  interpreted  as  time,  describe  how  the 
object  moves  on  the  curve. 

Exercise  11.4.2  What  curve  is  described  by  x  =  3 cos t,  y  =  3 sin/ ?  Ift  is  interpreted  as  time,  describe 
how  the  object  moves  on  the  curve. 

Exercise  11.4.3  What  cur\’e  is  described  by  x  —  3  cos  t,  y  =  2sin/  ?  Ift  is  interpreted  as  time,  describe 
how  the  object  moves  on  the  cur\>e. 

Exercise  11.4.4  What  curve  is  described  by  x  =  3sin  t,  y  —  3  cos  t?  Ift  is  interpreted  as  time,  describe 
how  the  object  moves  on  the  curve. 

Exercise  11.4.5  Sketch  the  cun>e  described  by  x  —  t  —t,y  —  t.Iftis  interpreted  as  time,  describe  how 
the  object  moves  on  the  curve. 

Exercise  11.4.6  A  wheel  of  radius  1  rolls  along  a  straight  line,  say  the  x-axis.  A  point  P  is  located  halfway 
between  the  center  of  the  wheel  and  the  rim;  assume  P  starts  at  the  point  (0, 1/2).  As  the  wheel  rolls,  P 
traces  a  curve;  find  parametric  equations  for  the  curve. 


11.5  Calculus  with  Parametric  Equations 


We  have  already  seen  how  to  compute  slopes  of  curves  given  by  parametric  equations — it  is  how  we 
computed  slopes  in  polar  coordinates. 


Solution.  We  compute  x!  =  1  —  cost,  /  =  sin/,  so 

dy  sin  t 
dx  1  —  cos  t 

Note  that  when  t  is  an  odd  multiple  of  n,  like  n  or  3 n,  this  is  (0/2)  =  0,  so  there  is  a  horizontal  tangent 
line,  in  agreement  with  figure  11.11.  At  even  multiples  of  71,  the  fraction  is  0/0,  which  is  undefined.  The 
figure  shows  that  there  is  no  tangent  line  at  such  points.  £ 

Areas  can  be  a  bit  trickier  with  parametric  equations,  depending  on  the  curve  and  the  area  desired.  We 
can  potentially  compute  areas  between  the  curve  and  the  x-axis  quite  easily. 
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Example  11.16:  Area  Under  Cycloid  Arch 


Find  the  area  under  one  arch  of  the  cycloid  x  —  t  —  sint,  y  =  1  —  cost. 


Solution.  We  would  like  to  compute 

rlTZ 

/  y  dx, 

Jo 

but  we  do  not  know  y  in  terms  of  x.  However,  the  parametric  equations  allow  us  to  make  a  substitution: 
use  y  =  1  —  cost  to  replace  y,  and  compute  dx  —  (1—  cost)  dt.  Then  the  integral  becomes 


cost)  dt 


3  k. 


Note  that  we  need  to  convert  the  original  x  limits  to  t  limits  using  x  —  t  —  sinf.  When  x  —  0,  t  —  sint, 
which  happens  only  when  t  =  0.  Likewise,  when  x  =  2k,  t  —  2k  —  sint  and  t  =  2k.  Alternately,  because 
we  understand  how  the  cycloid  is  produced,  we  can  see  directly  that  one  arch  is  generated  by  0  <  t  <  2k. 
In  general,  of  course,  the  t  limits  will  be  different  than  the  x  limits.  X 

This  technique  will  allow  us  to  compute  some  quite  interesting  areas,  as  illustrated  by  the  exercises. 

As  a  final  example,  we  see  how  to  compute  the  length  of  a  curve  given  by  parametric  equations.  The 
arc  length  for  functions  given  as  y  in  terms  of  x  is  the  formula: 


Using  some  properties  of  derivatives,  including  the  chain  rule,  we  can  convert  this  to  use  parametric 
equations  x  =  /(t),  y  —  g(t ): 


=  /"  v/(/'W)2  +  (s'(<))2  *• 

Ju  y 


Here  u  and  v  are  the  t  limits  corresponding  to  the  x  limits  a  and  h. 


Solution.  From  x 
is 


=  t  —  sint,  y  —  1  —  cost,  we  get  the  derivatives  /'  =  !.  —  cost  and  g' 


—  cost)2  +  sin2 1  dt 


\/2  —  2cost  dt. 


sint,  so  the  length 
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Now  we  use  the  formula  sin~(t/2)  =  (1  —  cos(f))/2  or  4sin~(t/2)  =  2  —  2cos t  to  get 

J  y//4sin2(t/2)  dt. 

Since  0  <  t  <  2 Jt,  sin(t/2)  >  0,  so  we  can  rewrite  this  as 

c2k 
10 


r2  n 

/  2sin(t/2)  dt  —  8. 

Jo 


X 


Exercises  for  11.5 


Exercise  11.5.1  Consider  the  curve  of  11.4.6  in  section  11.4.  Find  all  values  of  t  for  which  the  cur\’e  has 
a  horizontal  tangent  line. 

Exercise  11.5.2  Consider  the  curx’e  of  11.4.6  in  section  11.4.  Find  the  area  under  one  arch  of  the  curve. 

Exercise  11.5.3  Consider  the  cur\>e  of  11.4.6  in  section  11.4.  Set  up  an  integral  for  the  length  of  one  arch 
of  the  curx’e. 


11.6  Conics  in  Polar  Coordinates 


A  conic  section  is  a  curve  obtained  as  the  intersection  of  a  cone  and  a  plane.  One  useful  geometric 
definition  that  only  involves  the  plane  is  that  a  conic  consists  of  those  points  whose  distances  to  some 
point,  called  a  focus,  and  some  line,  called  a  directrix,  are  in  a  fixed  ratio,  called  the  eccentricity. 

The  three  types  of  conic  sections  are  the  ellipse,  parabola  and  hyperbola  (with  the  circle  being  a 
degenerate  case  of  an  ellipse). 
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Let  F  be  a  fixed  point  (the  focus),  L  be  a  line  (the  directrix)  not  containing  F  and  e  be  a  nonnegative 
real  number  (the  eccentricity).  The  conics  sections  are  obtained  by  the  set  of  all  points  P  whose  distance 
to  F  equals  e  times  their  distance  to  L,  that  is: 

\PF\ 


In  the  case  that: 


•  c<  1  we  obtain  an  ellipse  (and  when  e  =  0we  obtain  the  degenerate  case:  A  circle), 

•  e  —  1  we  obtain  a  parabola, 

•  e  >  1  we  obtain  a  hyperbola. 

To  obtain  a  simple  polar  equation  we  place  the  focal  point  at  the  origin.  The  formulation  for  a  conic 
section  is  then  given  in  the  polar  form  by 

pe  pe 

r  = -  and  r  — - 

1  ±  ecos  0  l±esin0 

where  e  is  the  eccentricity  and  p  is  the  focal  parameter  representing  the  distance  from  the  focus  (or  one  of 
the  two  foci)  to  the  directrix. 

The  three  different  types  of  conic  sections  are  shown  below.  Focal  points  corresponding  to  all  conic 
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sections  are  placed  at  the  origin.  First  is  the  parabola. 


Next  is  the  ellipse. 
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Finally,  we  have  the  hyperbola. 


Some  things  to  keep  in  mind  with  respect  to  the  denominator  of  polar  equations  of  various  conics  are 
the  following: 

•  If  the  denominator  is  1  +  esin 0,  it  has  a  horizontal  directrix  above  the  focal  point. 

•  If  the  denominator  is  1  —  esin 0,  it  has  a  horizontal  directrix  below  the  focal  point. 

•  If  the  denominator  is  1  +  ecos  0,  it  has  a  vertical  directrix  to  the  right  of  the  focal  point. 

•  If  the  denominator  is  1  —  ecos  0,  it  has  a  vertical  directrix  to  the  left  of  the  focal  point. 


Example  11.18:  Polar  Equations  for  a  Parabola 


Find  the  equation  of  a  parabola  with  focus  at  the  origin  and  whose  directrix  is  the  line  x  —  —  1. 


Solution.  Since  we  have  a  parabola,  e  =  1.  Furthermore,  p  —  1.  Since  the  graph  has  a  vertical  directrix, 
the  equation  will  use  1  —  ecos  0  in  the  denominator.  Thus,  the  equation  is: 

2 


Exercises  for  11.6 


Exercise  11.6.1  Identify  the  following  conics  and  find  the  eccentricity. 
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(a)  r 

(b)  r 


2 

1  +  sin  0 

4 

2  +  cos0 


(c)  r 


(d)  r 


3 

1  —  sin  0 

5 

2  +  2sin0 


Exercise  11.6.2  Write  the  polar  equation  of  a  parabola  with  focus  at  the  origin  and  directrix  x  —  3. 

Exercise  11.6.3  Write  the  polar  equation  of  a  hyperbola  with  focus  at  the  origin,  directrix  x  —  4  and 
eccentricity  2. 

Exercise  11.6.4  Write  the  polar  equation  of  an  ellipse  with  focus  at  the  origin,  directrix  x  —  4sec  0  and 
eccentricity  1/2. 


12.  Three  Dimensions 


12.1  The  Coordinate  System 


Throughout  the  text  thus  far  we  have  focused  investigating  functions  of  the  form  y  —  f(x),  with  one 
independent  and  one  dependent  variable.  Such  functions  can  be  represented  in  two  dimensions,  using  two 
numerical  axes  that  allow  us  to  identify  every  point  in  the  plane  with  two  numbers.  We  now  shift  our  focus 
to  three-dimensional  space;  to  identify  every  point  in  three  dimensions  we  require  three  numerical  values. 
The  obvious  way  to  make  this  association  is  to  add  one  new  axis,  perpendicular  to  the  x  and  y  axes  we 
already  understand.  We  could,  for  example,  add  a  third  axis,  the  z  axis,  with  the  positive  z  axis  coming 
straight  out  of  the  page,  and  the  negative  z  axis  going  out  the  back  of  the  page.  This  is  difficult  to  work 
with  on  a  printed  page,  so  more  often  we  draw  a  view  of  the  three  axes  from  an  angle: 


y 


You  must  then  imagine  that  the  z  axis  is  perpendicular  to  the  other  two.  Just  as  we  have  investigated 
functions  of  the  form  y  =  f(x)  in  two  dimensions,  we  will  investigate  three  dimensions  largely  by  consid¬ 
ering  functions;  now  the  functions  will  (typically)  have  the  form  z  —  f(x,y).  Due  to  the  fact  that  we  are 
used  to  having  the  result  of  a  function  graphed  in  the  vertical  direction,  it  is  somewhat  easier  to  maintain 
that  convention  in  three  dimensions.  To  accomplish  this,  we  normally  rotate  the  axes  so  that  z  points  up; 
the  result  is  then: 
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Note  that  if  you  imagine  looking  down  from  above,  along  the  z  axis,  the  positive  z  axis  will  come 
straight  toward  you,  the  positive  y  axis  will  point  up,  and  the  positive  x  axis  will  point  to  your  right,  as 
usual.  Any  point  in  space  is  identified  by  providing  the  three  coordinates  of  the  point,  as  shown;  naturally, 
we  list  the  coordinates  in  the  order  (jc,y,z).  One  useful  way  to  think  of  this  is  to  use  the  x  and  y  coordinates 
to  identify  a  point  in  the  x-y  plane,  then  move  straight  up  (or  down)  a  distance  given  by  the  z  coordinate. 

It  is  now  fairly  simple  to  understand  some  “shapes”  in  three  dimensions  that  correspond  to  simple 
conditions  on  the  coordinates.  In  two  dimensions  the  equation  x  =  1  describes  the  vertical  line  through 
(1,0).  In  three  dimensions,  it  still  describes  all  points  with  x-coordinate  1,  but  this  is  now  a  plane,  as  in 
Figure  12.1. 


Figure  12.1:  The  plane  x  =  1. 

Recall  the  very  useful  distance  formula  in  two  dimensions  which  comes  directly  from  the  Pythagorean 
Theorem:  the  distance  between  points  (xi,yi)  and  (x2,y2)  is  \J (xi  —  X2)2  +  (yi  — y2)2-  What  is  the  dis¬ 
tance  between  two  points  (xi,yi,zi)  and  {x2,yi,zi)  in  three  dimensions?  Geometrically,  we  want  the  length 
of  the  long  diagonal  labelled  c  in  the  “box”  in  Figure  12.2.  Since  a,  b,  c  form  a  right  triangle,  a2  +  b2  —  c2. 
b  is  the  vertical  distance  between  (xi,yi,zi)  and  (*2, yi,Z2),  so  b  =  |zi  —  Z2I -  The  length  a  runs  parallel  to 
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the  x-y  plane,  so  it  is  simply  the  distance  between  (jci,  vi )  and  (x2,y2),  that  is,  a 2  =  (ay  —  X2)2  +  (y  1  —  y2)2- 
Now  we  see  that  c 2  =  (xy  -  x2)2  +  (yi  -V2)2  +  (zi  -  Z2)2  and  c  =  ^ (ay  -x2)2  +  (yi  -V2)2  +  (zi  ~z2)2. 

It  is  sometimes  useful  to  give  names  to  points,  for  example  we  might  let  Pi  =  (ay,yi,zi),  or  more 
concisely  we  might  refer  to  the  point  Pi(xy,yi,Zi),  and  subsequently  use  just  P\.  Distance  between  two 
points  in  either  two  or  three  dimensions  is  sometimes  denoted  by  d,  so  for  example  the  formula  for  the 
distance  between  P\  (xi,yy,zi)  and  P2(x2,y2,z2)  might  be  expressed  as 

d(Pi,P2)  =  \j (ay  -x2)2  +  (yi -y2)2  +  (zi  ~z2)2. 


z 


Figure  12.2:  Distance  in  three  dimensions. 

In  two  dimensions,  the  distance  formula  immediately  gives  us  the  equation  of  a  circle:  the  circle 
of  radius  r  and  center  at  (h,k)  consists  of  all  points  (x,y)  at  distance  r  from  (h,k),  so  the  equation  is 
r  —  \J (x  —  h)2  +  (y  —  k)2  or  r2  —  (x  — h)2  +  (y  —  k)2.  Now  we  can  get  the  similar  equation  r2  —  (x  —  h)2  + 
(y  —  k)2  +  (z  —  l)2,  which  describes  all  points  (x,y,z)  at  distance  r  from  (h,k,  /),  namely,  the  sphere  with 
radius  r  and  center  ( h,k,l ). 
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Exercises  for  12.1 


Exercise  12.1.1  Sketch  the  location  of  the  points  (1,1,0),  (2, 3,  —  1 ),  and  ( —  1 , 2, 3)  on  a  single  set  of  axes. 

Exercise  12.1.2  Describe  geometrically  the  set  of  points  (. x,y,z )  that  satisfy  z  =  4. 

Exercise  12.1.3  Describe  geometrically  the  set  of  points  (x,y,z)  that  satisfy  y  —  —3. 

Exercise  12.1.4  Describe  geometrically  the  set  of  points  (x,y,z)  that  satisfy  x  +  y  =  2. 

Exercise  12.1.5  The  equation  x+y +  z  —  1  describes  some  collection  of  points  in  R  .  Describe  and  sketch 

the  points  that  satisfy  x+y+z—  1  and  are  in  the  x-y  plane,  in  the  x-z  plane,  and  in  the  y-z  plane. 

Exercise  12.1.6  Find  the  lengths  of  the  sides  of  the  triangle  with  vertices  (1,0, 1),  (2,2,  —  1),  and  (—3,2,  —2). 


Exercise  12.1.7  Find  the  lengths  of  the  sides  of  the  triangle  with  vertices  (2,2,3),  (8,6,5),  and  (  —  1,0,2). 
Why  do  the  results  tell  you  that  this  isn’t  really  a  triangle? 

Exercise  12.1.8  Find  an  equation  of  the  sphere  with  center  at  { 1,1,1)  and  radius  2. 

Exercise  12.1.9  Find  an  equation  of  the  sphere  with  center  at  (2,  —  1, 3)  and  radius  5. 

Exercise  12.1.10  Find  an  equation  of  the  sphere  with  center  (3,-2, 1)  and  that  goes  through  the  point 
(4,2,5). 

Exercise  12.1.11  Find  an  equation  of  the  sphere  with  center  at  (2, 1,-1)  and  radius  4.  Find  an  equation 
for  the  intersection  of  this  sphere  with  the  y-z  plane;  describe  this  intersection  geometrically. 

Exercise  12.1.12  Consider  the  sphere  of  radius  5  centered  at  (2,3,4).  What  is  the  intersection  of  this 
sphere  with  each  of  the  coordinate  planes? 

Exercise  12.1.13  Show  that  for  all  values  of  6  and  (j),  the  point  (a  sin  (j)  cos  0,  a  sin  (j)  sin  9,  a  cos  <j>)  lies  on 
the  sphere  given  by  x2  +  y2  +  z2  —  a2. 

Exercise  12.1.14  Prove  that  the  midpoint  of  the  line  segment  connecting  (*i,;yi,zi)  to  (x2,y2,Zi)  is  at 
( *i  +-*2  y\+yi  zi  +Z2\ 

V  2  ’  2  ’  2  )' 

Exercise  12.1.15  Any  three  points  P\(x\,y\,z\),  PiU'i+’i+.i),  If  U'3 , >’3 ,  Z3 ) ,  lie  in  a  plane  and  form  a 
triangle.  The  triangle  inequality  says  that  d(P\,Pf)  <  d(P\,Pf)  +  d(P2,Pf).  Prove  the  triangle  inequality 
using  either  algebra  (messy)  or  the  law  of  cosines  (less  messy). 

Exercise  12.1.16  Is  it  possible  for  a  plane  to  intersect  a  sphere  in  exactly  two  points?  Exactly  one  point? 
Explain. 
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12.2  Vectors 


A  vector,  denoted  v,  is  a  quantity  consisting  of  a  non-negative  magnitude  and  a  direction.  We  could 
represent  a  vector  in  two  dimensions  as  (m,  0),  where  m  is  the  magnitude  and  0  is  the  direction,  measured 
as  an  angle  from  some  agreed  upon  direction.  For  example,  we  might  think  of  the  vector  (5,45°)  as 
representing  “5  km  toward  the  northeast”;  that  is,  this  vector  might  be  a  displacement  vector,  indicating, 
say,  that  your  grandfather  walked  5  kilometers  toward  the  northeast  to  school  in  the  snow.  On  the  other 
hand,  the  same  vector  could  represent  a  velocity,  indicating  that  your  grandfather  walked  at  5  km/hr 
toward  the  northeast.  What  the  vector  does  not  indicate  is  where  this  walk  occurred:  a  vector  represents  a 
magnitude  and  a  direction,  but  not  a  location.  Pictorially  it  is  useful  to  represent  a  vector  as  an  arrow;  the 
direction  of  the  vector,  naturally,  is  the  direction  in  which  the  arrow  points;  the  magnitude  of  the  vector  is 
reflected  in  the  length  of  the  arrow. 

It  turns  out  that  many,  many  quantities  behave  as  vectors,  e.g.,  displacement,  velocity,  acceleration, 
force.  Already  we  can  get  some  idea  of  their  usefulness  using  displacement  vectors.  Suppose  that  your 
grandfather  walked  5  km  NE  and  then  2  km  SSE;  if  the  terrain  allows,  and  perhaps  armed  with  a  compass, 
how  could  your  grandfather  have  walked  directly  to  his  destination?  We  can  use  vectors  (and  a  bit  of 
geometry)  to  answer  this  question.  We  begin  by  noting  that  since  vectors  do  not  include  a  specification  of 
position,  we  can  “place”  them  anywhere  that  is  convenient.  So  we  can  picture  your  grandfather’s  journey 
as  two  displacement  vectors  drawn  head  to  tail: 


The  displacement  vector  for  the  shortcut  route  is  the  vector  drawn  with  a  dashed  line,  from  the  tail  of 
the  first  to  the  head  of  the  second.  With  a  little  trigonometry,  we  can  compute  that  the  third  vector  has 
magnitude  approximately  4.62  and  direction  21.43°,  so  walking  4.62  km  in  the  direction  21.43°  north  of 
east  (approximately  ENE)  would  get  your  grandfather  to  school.  This  sort  of  calculation  is  so  common, 
we  dignify  it  with  a  name:  we  say  that  the  third  vector  is  the  sum  of  the  other  two  vectors. 

There  is  another  common  way  to  picture  the  sum  of  two  vectors.  Put  the  vectors  tail  to  tail  and  then 
complete  the  parallelogram  they  indicate;  the  sum  of  the  two  vectors  is  the  diagonal  of  the  parallelogram: 


This  is  a  more  natural  representation  in  some  circumstances.  For  example,  if  the  two  original  vectors 
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represent  forces  acting  on  an  object,  the  sum  of  the  two  vectors  is  the  net  or  effective  force  on  the  object, 
and  it  is  convenient  to  draw  all  three  with  their  tails  at  the  location  of  the  object. 

We  also  define  scalar  multiplication  for  vectors:  if  v  is  a  vector  (m,  0)  and  a  >0  is  a  real  number, 
the  vector  ay  is  (am,  0),  namely,  it  points  in  the  same  direction  but  has  a  times  the  magnitude.  If  a  <  0, 
ay  is  (\a\m,  0  +  n),  with  \a\  times  the  magnitude  and  pointing  in  the  opposite  direction  (unless  we  specify 
otherwise,  angles  are  measured  in  radians). 

Now  we  can  understand  subtraction  of  vectors:  v  —  w  =  v  +  (— l)w: 


w 


w 


Note  that  as  you  would  expect,  w  +  (v  —  w)  =  v. 

We  can  represent  a  vector  in  ways  other  than  (m,  0),  and  in  fact  (m,  0)  is  not  generally  used  at  all. 
How  else  could  we  describe  a  particular  vector?  Consider  again  the  vector  (5,45°).  Let’s  draw  it  again, 
but  impose  a  coordinate  system.  If  we  put  the  tail  of  the  arrow  at  the  origin,  the  head  of  the  arrow  ends  up 
at  the  point  (5/y/2,5/V2)  ~  (3.54,3.54). 


(3.54,3.54) 


In  this  picture  the  coordinates  (3.54,3.54)  identify  the  head  of  the  arrow,  provided  we  know  that  the 
tail  of  the  arrow  has  been  placed  at  (0,0).  Then  in  fact  the  vector  can  always  be  identified  as  (3.54,3.54), 
no  matter  where  it  is  placed;  we  just  have  to  remember  that  the  numbers  3.54  must  be  interpreted  as  a 
change  from  the  position  of  the  tail,  not  as  the  actual  coordinates  of  the  arrow  head;  to  emphasize  this  we 
will  write  (3.54,3.54)  to  mean  the  vector  and  (3.54,3.54)  to  mean  the  point.  Then  if  the  vector  (3.54,3.54) 
is  drawn  with  its  tail  at  (1,2)  it  looks  like  this: 


(4.54,5.54) 


3.54 
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Consider  again  the  two  part  trip:  5  km  NE  and  then  2  km  SSE.  The  vector  representing  the  first  part  of 
the  trip  is  (5/V2,5/V2),  and  the  second  part  of  the  trip  is  represented  by  (2cos(— 37r/8),2sin(  — 3tt/8))  ~ 
(0.77,-1.85).  We  can  represent  the  sum  of  these  with  the  usual  head  to  tail  picture: 


It  is  clear  from  the  picture  that  the  coordinates  of  the  destination  point  are  ( 5 /  \/l  +  2  cos  ( — 3  tt/  8) ,  5  /  \fl  + 
2sin(— 3tt/8))  or  approximately  (4.3, 1.69),  so  the  sum  of  the  two  vectors  is  (5/v/2  +  2cos(— 3^/8),5/v/2  + 
2sin(— 3tt/8) )  ~  (4.3, 1.69).  Adding  the  two  vectors  is  easier  in  this  form  than  in  the  (m,  0)  form,  provided 
that  we’re  willing  to  have  the  answer  in  this  form  as  well. 

It  is  easy  to  see  that  scalar  multiplication  and  vector  subtraction  are  also  easy  to  compute  in  this  form: 
a(v,w)  —  ( av,aw )  and  (v\,w\)  —  (y2,W2)  —  (vi  —  V2,wi  —  W2).  What  about  the  magnitude?  The  magnitude 
of  the  vector  (v,w)  is  still  the  length  of  the  corresponding  arrow  representation;  this  is  the  distance  from 
the  origin  to  the  point  (v,w),  namely,  the  distance  from  the  tail  to  the  head  of  the  arrow.  Using  the  familiar 
distance  formula  the  magnitude  of  the  vector  is  simply  \J v2  +  w2 ,  which  we  also  denote  with  absolute 
value  bars:  |(v,w)|  =  \J v2  +  w2. 

In  three  dimensions,  vectors  are  still  quantities  consisting  of  a  magnitude  and  a  direction,  but  of  course 
there  are  many  more  possible  directions.  It’s  not  clear  how  we  might  represent  the  direction  explicitly, 
but  the  coordinate  version  of  vectors  makes  just  as  much  sense  in  three  dimensions  as  in  two.  By  (1,2,3) 
we  mean  the  vector  whose  head  is  at  (1,2,3)  if  its  tail  is  at  the  origin.  As  before,  we  can  place  the  vector 
anywhere  we  want;  if  it  has  its  tail  at  (4,5,6)  then  its  head  is  at  (5,7,9).  It  remains  true  that  arithmetic  is 
easy  to  do  with  vectors  in  this  form. 
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Figure  12.3:  The  vector  (2,4,5)  with  its  tail  at  the  origin. 

Three  particularly  simple  vectors  turn  out  to  be  quite  useful:  i  =  (1,0,0),  j  =  (0, 1,0),  and  k  =  (0,0, 1). 
These  play  much  the  same  role  for  vectors  that  the  axes  play  for  points.  In  particular,  notice  that 

(vi,v2,v3)  =  (vi,0,0)  +  (0,v2,0)  +  (0,0, v3) 

=  vi(l,0,0)  + v2(0, 1,0)  +v3(0,0, 1) 

=  vii  +  v2j  +  v3k 

Thus  far,  we  have  focused  our  discussion  on  vectors  which  begin  at  the  origin  and  end  at  a  point. 
However  we  will  frequently  want  to  produce  a  vector  that  points  from  one  point  to  another.  That  is,  if  P 
and  Q  are  points,  we  seek  the  vector  x  such  that  when  the  tail  of  x  is  placed  at  P,  its  head  is  at  Q\  we  refer 

to  this  vector  as  PQ.  If  we  know  the  coordinates  of  P  and  Q,  the  coordinates  of  the  vector  are  easy  to  find. 


Exercises  for  12.2 


Exercise  12.2.1  Draw  the  vector  (3,-1)  with  its  tail  at  the  origin. 


Exercise  12.2.2  Draw  the  vector  (3,  —  1,2)  with  its  tail  at  the  origin. 

Exercise  12.2.3  Let  v  be  the  vector  with  tail  at  the  origin  and  head  at  (1,2);  let  w  be  the  vector  with  tail 
at  the  origin  and  head  at  (3, 1).  Draw  v  and  w  and  a  vector  u  with  tail  at  (1,2)  and  head  at  (3, 1).  Draw 
u  with  its  tail  at  the  origin. 
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Exercise  12.2.4  Let  v  be  the  vector  with  tail  at  the  origin  and  head  at  (—1,2);  let  w  be  the  vector  with 
tail  at  the  origin  and  head  at  (3,3).  Draw  v  and  w  and  a  vector  u  with  tail  at  (  —  1,2)  and  head  at  (3,3). 
Draw  u  with  its  tail  at  the  origin. 

Exercise  12.2.5  Let  v  be  the  vector  with  tail  at  the  origin  and  head  at  (5,2);  let  w  be  the  vector  with  tail 
at  the  origin  and  head  at  (1,5).  Draw  v  and  w  and  a  vector  u  with  tail  at  (5,2)  and  head  at  (1,5).  Draw 
u  with  its  tail  at  the  origin. 

Exercise  12.2.6  Find  |v|,  v  +  w,  v  —  w,  |v  +  w|,  |v  —  w\  and  —Ivforv  =  (1,3)  and w  =  (-1,-5). 

Exercise  12.2.7  Find  |v|,  v  +  w,  v  —  w,  |v  +  w|,  |v  —  w\  and  —Ivforv  —  (1,2,3)  and w  —  (—1,2, —3). 

Exercise  12.2.8  Find  |v|,  v  +  w,  v  —  w,  |v  +  w|,  |v  —  w»|  and  —Ivforv  =  (1,0,1)  and  w  —  (-1,-2, 2). 

Exercise  12.2.9  Find  \v\,  v  +  w,  v  —  w,  |v  +  w|,  |v  —  w\  and  —2v  for  v  —  (1,  —  1, 1)  and  w  —  (0,0,3). 

Exercise  12.2.10  Find  |v|,  v  +  w,  v  —  w,  |v  +  w|,  |v  —  w\  and  —Ivforv  —  (3,2,1)  andw  —  (— 1,  — 1,  — 1). 

Exercise  12.2.11  Let  P  =  (4,5,6),  Q  —  (1,2,  —5).  Find  PQ.  Find  a  vector  with  the  same  direction  as  PQ 
but  with  length  1.  Find  a  vector  with  the  same  direction  as  PQ  but  with  length  4. 


Exercise  12.2.12  IfA,B,  and  C  are  three  points,  find  AB  +  BC  +  CA. 

Exercise  12.2.13  Consider  the  12  vectors  that  have  their  tails  at  the  center  of  a  clock  and  their  respective 
heads  at  each  of  the  12  digits.  What  is  the  sum  of  these  vectors?  What  if  we  remove  the  vector  corre¬ 
sponding  to  4  o’clock?  What  if,  instead,  all  vectors  have  their  tails  at  12  o’clock,  and  their  heads  on  the 
remaining  digits? 

Exercise  12.2.14  Let  a  and  b  be  nonzero  vectors  in  two  dimensions  that  are  not  parallel  or  anti-parallel. 
Show,  algebraically,  that  ifc  is  any  two  dimensional  vector,  there  are  scalars  s  and  t  such  that  c  —  sa  +  tb. 

Exercise  12.2.15  Does  the  statement  in  the  previous  exercise  hold  if  the  vectors  a,  b,  and  c  are  three 
dimensional  vectors?  Explain. 


12.3  The  Dot  Product 


The  goal  of  this  section  is  to  answer  the  following  question.  Given  two  vectors,  what  is  the  angle  between 
them? 

Since  vectors  have  no  position,  we  are  free  to  place  vectors  wherever  we  like.  If  the  two  vectors  are 
placed  tail-to-tail,  there  is  now  a  reasonable  interpretation  of  the  question:  we  seek  the  measure  of  the 
smallest  angle  between  the  two  vectors,  in  the  plane  in  which  they  lie.  Figure  12.4  illustrates  the  situation. 
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Figure  12.4:  The  angle  between  vectors  v  and  w. 

Since  the  angle  9  lies  in  a  triangle,  we  can  compute  it  using  a  bit  of  trigonometry,  namely,  the  law  of 
cosines.  Remember  that  the  law  of  cosines  states  c2  —  a2  +  b2  —  2abcosC. 

The  lengths  of  the  sides  of  the  triangle  in  Figure  12.4  are  |v|,  |w|,  and  |v  —  w|.  Let  v  =  (vi,V2,V3)  and 
w  =  {w\,W2,W'i)\  then 

| v  —  w|2  =  | v | 2  — F  |  w | 2  —  2|v||w|cos0 
2|v||w|  cost?  =  |v|2  +  |w|2  —  |v  — w|2 

—  V1  +  v2  +  v3  +  W?  +  W2  +  W2  —  (Vi  —  Wi)2  —  (V2  —  W2)2  —  (V3  —  W3)2 
—  v 1  +  v2  +  v3  +  W\  +  W2  +  W2 

—  (v2  —  2viwi  +  w2)  —  {y\  —  2v2W2  +  w\)  —  (v2  —  2V3W3  +  w2) 

=  2viWi  +  2v2W2  +  2V3W3 
|v|  |w|  COS  0  —  V1W1  +  V2W2  +  V3W3 

cos0  =  (viwi  +  V2W2  T"  V3W3)/  ( |  v|  |  w| ) 

A  bit  of  simple  arithmetic  with  the  coordinates  of  v  and  w  allows  us  to  compute  the  cosine  of  the  angle 
between  them.  If  necessary  we  can  use  the  arccosine  to  get  0,  but  in  many  problems  cos  0  turns  out  to  be 
all  we  really  need. 

The  numerator  of  the  fraction  that  gives  us  cos  9  turns  up  a  lot,  so  we  give  it  a  name  and  more  compact 
notation:  we  call  it  the  dot  product,  and  write  it  as 

V  •  W  =  V|  W|  +  V2W2  +  V3W3 

This  is  the  same  symbol  we  use  for  ordinary  multiplication,  but  there  should  never  be  any  confusion;  you 
can  tell  from  context  whether  we  are  “multiplying”  vectors  or  numbers.  (We  might  also  use  the  dot  for 
scalar  multiplication:  a-  \  —  a\;  again,  it  is  clear  what  is  meant  from  context.) 


Solution.  We  know  that  cos0  =  v-w/(|v||w|)  =  (l-3  +  2-  l  +  l-  (— 5))/(|v||w|)  =  0,  so  9  —  k/2,  that  is, 
the  vectors  are  perpendicular.  X 
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Solution.  We  compute 

cos  9  =  (3-1  +  3-  0  +  0-0)/ (V9  +  9  +  0\/ 1+0  +  0) 

=  3/V^8  =  1/^2 

so  0  =  n/A.  X 

The  following  are  some  special  cases  worth  looking  at. 


Solution. 


1.  cos0  =  v-v/(|v||y|)  =  (v\  +  V2  +  v3)/(y^ vi+v2  +  v3\/vi  +v2  +  v3)  =  so  the  angle  between  v 
and  itself  is  zero,  which  of  course  is  correct. 

2.  cos  0  =  v  -  — v/(|v||  —  v|)  =  (— v\  —  V2  —  V3 )/{\Jv\  +  v^  +  V3  \Jv\  +  v^  +  v^)  =  —1,  so  the  angle  is  n, 
that  is,  the  vectors  point  in  opposite  directions,  as  of  course  we  already  knew. 

3.  cos0  =  v •  0/( | v|  |0| )  =  (0  +  0  +  0) / ( v\  +  v\  +  y/O2  +  02  +  02 ) ,  which  is  undefined.  On  the  other 
hand,  note  that  since  v  •  0  =  0  it  looks  at  first  as  if  cos  0  will  be  zero,  which  as  we  have  seen  means 
that  vectors  are  perpendicular;  only  when  we  notice  that  the  denominator  is  also  zero  do  we  run  into 
trouble.  One  way  to  “fix”  this  is  to  adopt  the  convention  that  the  zero  vector  0  is  perpendicular  to  all 
vectors;  then  we  can  say  in  general  that  if  v  ■  w  =  0,  v  and  w  are  perpendicular. 


X 


Generalizing  the  examples,  note  the  following  useful  facts: 

•  If  v  is  parallel  or  anti-parallel  to  w  then  v  •  w/(|v|  |w|)  =  ±1,  and  conversely,  if  v  ■  w/(|v|  |w|)  =  1, 
v  and  w  are  parallel,  while  if  v  •  w/(|v||w|)  =  — 1,  v  and  w  are  anti-parallel.  (Vectors  are  parallel  if 
they  point  in  the  same  direction,  anti-parallel  if  they  point  in  opposite  directions.) 

•  If  v  is  perpendicular  to  w  then  v  •  w/(|v|  |w|)  =  0,  and  conversely  if  v  ■  w/(|v|  |w|)  =0  then  v  and  w 
are  perpendicular. 

Given  two  vectors,  it  is  often  useful  to  find  the  projection  of  one  vector  onto  the  other,  because  this 
turns  out  to  have  important  meaning  in  many  circumstances.  More  precisely,  given  v  and  w,  we  seek  a 
vector  parallel  to  w  but  with  length  determined  by  v  in  a  natural  way,  as  shown  in  Figure  12.5.  p  is  chosen 
so  that  the  triangle  formed  by  v,  p,  and  v  —  p  is  a  right  triangle. 
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v 


w 


Figure  12.5:  p  is  the  projection  of  v  onto  w. 


Using  a  little  trigonometry,  we  see  that 


.  ,  V- W  V- w 

p  =  v  cost?  =  V  — — 7  =  - — - 

V  w  w 


this  is  sometimes  called  the  scalar  projection  of  v  onto  w.  To  get  p  itself,  we  multiply  this  length  by  a 
vector  of  length  one  parallel  to  w: 


V- W  W  V- w 

p=  =  r^:w- 

|w|  |w|  |w|z 


Be  sure  that  you  understand  why  w/|w|  is  a  vector  of  length  one  (also  called  a  unit  vector)  parallel  to  w. 

The  discussion  so  far  implicitly  assumed  that  0  <  0  <  k/2.  U'  k/2  <  0  <  k,  the  picture  is  like  Fig¬ 
ure  12.6.  In  this  case  v  •  w  is  negative,  so  the  vector 


v-  w 


is  anti-parallel  to  w,  and  its  length  is 


v-  w 

Iwl 


In  general,  the  scalar  projection  of  v  onto  w  may  be  positive  or  negative.  If  it  is  negative,  it  means  that 
the  projection  vector  is  anti-parallel  to  w  and  that  the  length  of  the  projection  vector  is  the  absolute  value 
of  the  scalar  projection.  Of  course,  you  can  also  compute  the  length  of  the  projection  vector  as  usual,  by 
applying  the  distance  formula  to  the  vector. 


^7  W 


V 

p 

Figure  12.6:  p  is  the  projection  of  v  onto  w. 


Note  that  the  phrase  “projection  onto  w”  is  a  bit  misleading  if  taken  literally;  all  that  w  provides  is  a 
direction;  the  length  of  w  has  no  impact  on  the  final  vector.  In  Figure  12.7,  for  example,  w  is  shorter  than 
the  projection  vector,  but  this  is  perfectly  acceptable. 
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Figure  12.7:  p  is  the  projection  of  v  onto  w. 


Physical  force  is  a  vector  quantity.  It  is  often  necessary  to  compute  the  “component”  of  a  force  acting 
in  a  different  direction  than  the  force  is  being  applied. 


Example  12.5:  Components  of  Force  Vector 


Suppose  a  ten  pound  weight  is  resting  on  an  inclined  plane — a  pitched  roof,  for  example.  Gravity 
exerts  a  force  of  ten  pounds  on  the  object,  directed  straight  down.  It  is  useful  to  think  of  the  compo¬ 
nent  of  this  force  directed  down  and  parallel  to  the  roof,  and  the  component  down  and  directly  into 
the  roof.  These  forces  are  the  projections  of  the  force  vector  onto  vectors  parallel  and  perpendicular 
to  the  roof.  Suppose  the  roof  is  tilted  at  a  30°  angle,  as  in  Figure  12.8.  Compute  the  component  of 
the  force  directed  down  the  roof  and  the  component  of  the  force  directed  into  the  roof. 


Solution.  A  vector  parallel  to  the  roof  is  ( — \/3 ,  —  1 ) ,  and  a  vector  perpendicular  to  the  roof  is  (1,  —  \/3). 
The  force  vector  is  F  =  (0,-10).  The  component  of  the  force  directed  down  the  roof  is  then 


Ft 


F- (-73,-1) 

|(-V/3,-l>|2 


(-V3.-1) 


10  (— x/3,  — 1) 
2  2 


(— 5v/3/2,  —5/2) 


with  length  5.  The  component  of  the  force  directed  into  the  roof  is 


,  F-  (1,— \/3)  ^  W3(l,-V3) 

=  i — 2— 


(5^/2, -15/2) 


with  length  5\/3.  Thus,  a  force  of  5  pounds  is  pulling  the  object  down  the  roof,  while  a  force  of  5\/3 
pounds  is  pulling  the  object  into  the  roof.  £ 


Figure  12.8:  Components  of  a  force. 
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The  dot  product  has  some  familiar-looking  properties  that  will  be  useful  later,  so  we  list  them  here. 
These  may  be  proved  by  writing  the  vectors  in  coordinate  form  and  then  performing  the  indicated  calcu¬ 
lations;  subsequently  it  can  be  easier  to  use  the  properties  instead  of  calculating  with  coordinates. 


Exercises  for  12.3 


Exercise  12.3.1  Find  (1, 1, 1)  •  (2,  —3,4). 

Exercise  12.3.2  Find  (1,2,0)  •  (0,0,57). 

Exercise  12.3.3  Find  (3,2, 1)  •  (0, 1,0). 

Exercise  12.3.4  Find  (-1,-2, 5)  ■  (1,0, —  1). 

Exercise  12.3.5  Find  (3,4,6)  •  (2,3,4). 

Exercise  12.3.6  Find  die  cosine  of  die  angle  between  (1,2,3)  and  (1, 1, 1);  use  a  calculator  if  necessary 
to  find  the  angle. 

Exercise  12.3.7  Find  the  cosine  of  the  angle  between  (-1,-2, —3)  and  (5,0,2);  use  a  calculator  if 
necessary  to  find  the  angle. 

Exercise  12.3.8  Find  the  cosine  of  the  angle  between  (47, 100,0)  and  (0,0,5);  use  a  calculator  if  neces¬ 
sary  to  find  the  angle. 

Exercise  12.3.9  Find  the  cosine  of  the  angle  between  (1,0, 1)  and  (0, 1, 1);  use  a  calculator  if  necessary 
to  find  the  angle. 

Exercise  12.3.10  Find  the  cosine  of  the  angle  between  (2,0,0)  and  (—1, 1,-1);  use  a  calculator  if  nec¬ 
essary  to  find  the  angle. 
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Exercise  12.3.11  Find  the  angle  between  the  diagonal  of  a  cube  and  one  of  the  edges  adjacent  to  the 
diagonal. 

Exercise  12.3.12  Find  the  scalar  and  vector  projections  of  { 1,2,3)  onto  (1,2,0). 

Exercise  12.3.13  Find  the  scalar  and  vector  projections  of  (l,  1, 1)  onto  (3,2, 1). 

Exercise  12.3.14  A  force  of  10  pounds  is  applied  to  a  wagon,  directed  at  an  angle  of  30°.  Find  the 
component  of  this  force  pulling  the  wagon  straight  up,  and  the  component  pulling  it  horizontally  along  the 
ground. 


F 


Figure  12.9:  Pulling  a  wagon. 


Exercise  12.3.15  A  force  of  15  pounds  is  applied  to  a  wagon,  directed  at  an  angle  of  45°.  Find  the 
component  of  this  force  pulling  the  wagon  straight  up,  and  the  component  pulling  it  horizontally  along  the 
ground. 

Exercise  12.3.16  Use  the  dot  product  to  find  a  non-zero  vector  w  perpendicular  to  both  u  —  (1,2,  —3)  and 
v  =  (2,0,1). 

Exercise  12.3.17  Let  x  —  (1, 1,0)  and y  =  (2,4,2).  Find  a  unit  vector  that  is  perpendicular  to  both  x  and 

y- 

Exercise  12.3.18  Do  the  three  points  (1,2,0),  (—2, 1, 1),  and  (0,3,  — 1  )form  a  right  triangle? 

Exercise  12.3.19  Do  the  three  points  (1,1,1),  (2,3,2),  and  (5,0,  —  1)  form  a  right  triangle? 

Exercise  12.3.20  Show  that  |v •  w\  <  |v|  |w| 

Exercise  12.3.21  Let  x  and  y  be  perpendicular  vectors.  Use  Theorem  12.6  to  prove  that  |jc|2  +  |j|2  = 
|jc  +y\2-  What  is  this  result  better  known  as? 

Exercise  12.3.22  Prove  that  the  diagonals  of  a  rhombus  intersect  at  right  angles. 

Exercise  12.3.23  Suppose  that  z  —  |x|j  +  [y|x  where  x,  y,  and  z  are  all  nonzero  vectors.  Prove  that  z 
bisects  the  angle  between  x  and  y. 


Exercise  12.3.24  Prove  Theorem  12.6. 
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12.4  The  Cross  Product 


Suppose  we  are  given  two  vectors.  In  many  cases  it  is  useful  to  find  a  third  vector  perpendicular  to  the 
first  two.  There  are  of  course  an  infinite  number  of  such  vectors  of  different  lengths.  Nevertheless,  let 
us  find  one.  Suppose  v  =  (v1.v2.v3)  and  w  =  (w1.w2.w3).  We  want  to  find  a  vector  c  =  (c1.c2.c3)  with 
c  •  v  =  c  •  w  =  0,  or 


Vjci  +V2C2  +  V3C3  =  0, 

W1C1  +  W2C2  +  W3C3  =  0. 

Multiply  the  first  equation  by  W3  and  the  second  by  V3  and  subtract  to  get 

W3V1C1  +  W3V2C2  +  W3V3C3  =  0 
V3W1C1  +  V3W2C2  +  V3W3C3  =  0 
(V1W3  —  WiV3)ci  +  (V2W3  —  W2V3)c2  =  0 

Of  course,  this  equation  in  two  variables  has  many  solutions;  a  particularly  easy  one  to  see  is  ci  =  V2W3  — 
W2V3,  C2  =  W1V3  —  V1W3.  Substituting  back  into  either  of  the  original  equations  and  solving  for  C3  gives 
C3  =  V1W2  —  W1V2. 

This  particular  answer  to  the  problem  turns  out  to  have  some  nice  properties,  and  it  is  dignified  with  a 
name:  the  cross  product: 


VXW  =  (V2W3  —  W2V3.W1V3  —  ViW3,ViW2  —  W1V2). 


While  there  is  a  nice  pattern  to  this  vector,  it  can  be  a  bit  difficult  to  memorize;  here  is  a  convenient 
mnemonic.  The  determinant  of  a  two  by  two  matrix  is 


a  b 
c  d 


ad  —  cb. 


This  is  extended  to  the  determinant  of  a  three  by  three  matrix: 


x 

vi 

Wi 


y 

z 

V2 

V3 

w2 

W3 

V3 

W3 


V3 

W3 


Vi 

Wi 


V2 

w2 


=  x(v2W3  —  W2V3)  — y(viW3  —  W1V3)  +  z(v\W2 
=  x(v2W3  —  W2V3)  +y(wiV3  —  V1W3)  +  z(viW2 


Wi  v2) 
Wl  v2). 


Each  of  the  two  by  two  matrices  is  formed  by  deleting  the  top  row  and  one  column  of  the  three  by  three 
matrix;  the  subtraction  of  the  middle  term  must  also  be  memorized.  This  is  not  the  place  to  extol  the  uses  of 
the  determinant;  suffice  it  to  say  that  determinants  are  extraordinarily  useful  and  important.  Here  we  want 
to  use  it  merely  as  a  mnemonic  device.  You  will  have  noticed  that  the  three  expressions  in  parentheses  on 
the  last  line  are  precisely  the  three  coordinates  of  the  cross  product;  replacing  x,  y,  z  by  i,  j,  k  gives  us 


i  j  k 

Vi  V2  v3 
W 1  W2  W3 


(V2W3  —  W2V3)i  —  (V1W3  —  W |  V3 ) j  +  (V1W2  —  WiV2)k 
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=  (V2W3  —  W2V3)i+  (VV1V3  —  ViW3)j  +  (v\W2  —  Viqv2)k 
=  (V2W3  —  W2V3,WiV3  —  V\W'i,V\W2  ~  W1V2) 

=  YXW. 


Given  v  and  w,  there  are  typically  two  possible  directions  and  an  infinite  number  of  magnitudes  that 
will  give  a  vector  perpendicular  to  both  v  and  w.  As  we  have  picked  a  particular  one,  we  should  investigate 
the  magnitude  and  direction. 

We  know  how  to  compute  the  magnitude  of  v  x  w;  it’s  a  bit  messy  but  not  difficult.  It  is  somewhat 
easier  to  work  initially  with  the  square  of  the  magnitude,  so  as  to  avoid  the  square  root: 

|v  X  w|2  =  (V2W3  —  W2V3)2  +  (W1V3  —  V’l  W3)2  +  (V1W2  —  W1V2)2 

99  9999  9999  99 

=  v2W3  —  2V2W3W2V3  +  w2v 3  4-WjV 3  —  2W1V3V1W3  +  +  vlw2  —  2V1W2W1V2  +  wqv2 


While  it  is  far  from  obvious,  this  nasty  looking  expression  can 


|v  X  w|  —  (Vj  T  V2  T  V3)  (\V  j  W2  4"  W3)  — 


=  |v 
=  |v 
=  |v 
=  |v 
|v  X  w|  =  |v 


w|2  -  (y  ■  w)2 


2 
2 

2 1  w|^(  1  —  COS“  0) 
2|w|2sin2  0 
Iwl  sin0 


w|  —  |v|  |w|  cos  0 
2/ 


be  simplified: 

(v’lWl  +  V2W2  +  V3W3)2 


The  magnitude  of  v  x  w  is  thus  very  similar  to  the  dot  product.  In  particular,  notice  that  if  v  is  parallel 
to  w,  the  angle  between  them  is  zero,  so  sin  6  —  0,  so  |v  x  w|  =  0,  and  likewise  if  they  are  anti-parallel, 
sin  0=0,  and  |v  x  w|  =  0.  Conversely,  if  |  v  x  w|  =0  and  |v|  and  |w|  are  not  zero,  it  must  be  that  sin  0=0, 
so  v  is  parallel  or  anti-parallel  to  w. 

Here  is  a  curious  fact  about  this  quantity  that  turns  out  to  be  quite  useful  later  on:  Given  two  vectors, 
we  can  put  them  tail  to  tail  and  form  a  parallelogram,  as  in  Figure  12.10.  The  height  of  the  parallelogram, 
h,  is  |v|  sin  0,  and  the  base  is  |w|,  so  the  area  of  the  parallelogram  is  |v|  |w|  sin  0,  exactly  the  magnitude  of 
|v  X  wl. 


Figure  12.10:  A  parallelogram. 


What  about  the  direction  of  the  cross  product?  Remarkably,  there  is  a  simple  rule  that  describes  the 
direction.  Let’s  look  at  a  simple  example:  Let  v  =  (a,  0,0),  w  =  (b,c,  0).  If  the  vectors  are  placed  with 
tails  at  the  origin,  v  lies  along  the  x-axis  and  w  lies  in  the  x-y  plane,  so  we  know  the  cross  product  will 
point  either  up  or  down.  The  cross  product  is 


i 

a 


b 


j  k 

0  0 
c  0 


(0,0,ac). 


V  X  w  = 


c 
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As  predicted,  this  is  a  vector  pointing  up  or  down,  depending  on  the  sign  of  ac.  Suppose  that  a  >  0,  so 
the  sign  depends  only  on  c:  if  c  >  0,  ac  >  0  and  the  vector  points  up;  if  c  <  0,  the  vector  points  down. 
On  the  other  hand,  if  a  <  0  and  c  >  0,  the  vector  points  down,  while  if  a  <  0  and  c  <  0,  the  vector  points 
up.  Here  is  how  to  interpret  these  facts  with  a  single  rule:  Imagine  rotating  vector  v  until  it  points  in  the 
same  direction  as  w;  there  are  two  ways  to  do  this — use  the  rotation  that  goes  through  the  smaller  angle. 
If  a  >  0  and  c  >  0,  or  a  <  0  and  c  <  0,  the  rotation  will  be  counter-clockwise  when  viewed  from  above; 
in  the  other  two  cases,  v  must  be  rotated  clockwise  to  reach  w.  The  rule  is:  counter-clockwise  means  up, 
clockwise  means  down.  If  v  and  w  are  any  vectors  in  the  x-y  plane,  the  same  rule  applies — v  need  not  be 
parallel  to  the  x-axis. 

Although  it  is  somewhat  difficult  computationally  to  see  how  this  plays  out  for  any  two  starting  vectors, 
the  rule  is  essentially  the  same.  Place  v  and  w  tail  to  tail.  The  plane  in  which  v  and  w  lie  may  be  viewed 
from  two  sides;  view  it  from  the  side  for  which  v  must  rotate  counter-clockwise  to  reach  w;  then  the  vector 
v  x  w  points  toward  you. 

This  rule  is  usually  called  the  right  hand  rule.  Imagine  placing  the  heel  of  your  right  hand  at  the  point 
where  the  tails  are  joined,  so  that  your  slightly  curled  fingers  indicate  the  direction  of  rotation  from  v  to 
w.  Then  your  thumb  points  in  the  direction  of  the  cross  product  v  x  w. 

One  immediate  consequence  of  these  facts  is  that  vxw^wxv,  because  the  two  cross  products  point 
in  the  opposite  direction.  On  the  other  hand,  since 

|v  x  w|  =  |v||  w|  sin0  =  |w||v|  sin0  =  |w  x  v|, 

the  lengths  of  the  two  cross  products  are  equal,  so  we  know  that  vxw  =  -(wxv). 

The  cross  product  has  some  familiar-looking  properties  that  will  be  useful  later,  so  we  list  them  here. 
As  with  the  dot  product,  these  can  be  proved  by  performing  the  appropriate  calculations  on  coordinates, 
after  which  we  may  sometimes  avoid  such  calculations  by  using  the  properties. 


Exercises  for  12.4 


Exercise  12.4.1  Find  the  cross  product  of  (1, 1, 1)  and  (1,2,3). 
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Exercise  12.4.2  Find  the  cross  product  of  (1,0,2)  and  (-1,-2, 4). 

Exercise  12.4.3  Find  the  cross  product  of  (—2, 1,3)  and  (5,2,  —  1). 

Exercise  12.4.4  Find  the  cross  product  of  { 1,0,0)  and  (0,0, 1). 

Exercise  12.4.5  Two  vectors  u  and  v  are  separated  by  an  angle  of  K / 6,  and  \u\  —  2  and  |v|  =3.  Find 
| u  x  v|. 

Exercise  12.4.6  Two  vectors  u  and  v  are  separated  by  an  angle  of  Ji / A,  and  |m|  =  3  and  |v|  —1.  Find 
I U  X  v|. 

Exercise  12.4.7  Find  the  area  of  the  parallelogram  with  vertices  (0,0),  (1,2),  (3,7),  and  (2,5). 

Exercise  12.4.8  Find  and  explain  the  value  of  (, i  xj)  x  k  and  (, i  +  j )  x  (i  —j). 

Exercise  12.4.9  Prove  that  for  all  vectors  u  and  v,  («xv)-v  =  0. 

Exercise  12.4.10  Prove  Theorem  12.7. 

Exercise  12.4.11  Define  the  triple  product  of  three  vectors,  x,  y,  and  z,  to  be  the  scalar  x  ■  (y  x  z).  Show 
that  three  vectors  lie  in  the  same  plane  if  and  only  if  their  triple  product  is  zero.  Verify  that  (1,5,— 2), 
(4,3,0)  and  (6, 13,-4)  all  lie  in  the  same  plane. 


12.5  Lines  and  Planes 


Lines  and  planes  are  perhaps  the  simplest  of  curves  and  surfaces  in  three  dimensional  space.  They  also 
will  prove  important  as  we  seek  to  understand  more  complicated  curves  and  surfaces. 

You  may  recall  that  the  equation  of  a  line  in  two  dimensions  is  ax  +  by  —  c;  it  is  reasonable  to  expect 
that  a  line  in  three  dimensions  is  given  by  ax  +  by  +  cz  —  d.  However  it  turns  out  that  this  is  the  equation 
of  a  plane.  We  will  turn  our  attention  to  a  study  of  planes  and  return  to  consider  lines  later  in  this  section. 

A  plane  does  not  have  an  obvious  “direction”  as  does  a  line.  It  is  possible  to  associate  a  plane  with 
a  direction  in  a  very  useful  way,  however:  there  are  exactly  two  directions  perpendicular  to  a  plane.  Any 
vector  with  one  of  these  two  directions  is  called  normal  to  the  plane.  While  there  are  many  normal  vectors 
to  a  given  plane,  they  are  all  parallel  or  anti-parallel  to  each  other. 

Suppose  two  points  (v1.v2.v3)  and  (wi,W2,W3)  are  in  a  plane;  then  the  vector  {w\  —  v\,W2  —  V2,wj  —  V3) 
is  parallel  to  the  plane.  In  particular,  if  this  vector  is  placed  with  its  tail  at  (v1.v2.v3)  then  its  head  is  at 
(wi ,  W2,  W3)  and  it  lies  in  the  plane.  As  a  result,  any  vector  perpendicular  to  the  plane  is  perpendicular 
to  (wi  —  vi,W2  —  V2.W3  —  V3).  In  fact,  it  is  easy  to  see  that  the  plane  consists  of  precisely  those  points 
(vvi,W2,W3)  for  which  (wi  —  vi,W2  —  V2,W3  —  V3)  is  perpendicular  to  a  normal  to  the  plane,  as  indicated  in 
Figure  12. 1 1.  Turning  this  around,  suppose  we  know  that  (a,  b,  c)  is  normal  to  a  plane  containing  the  point 
(v1.v2.v3).  Then  (x,y,z)  is  in  the  plane  if  and  only  if  ( a,b,c )  is  perpendicular  to  (x—v\,y  —  V2,z  —  vf).  In 
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turn,  we  know  that  this  is  true  precisely  when  ( a,b,c )  ■  (x  —  vi,y»  v2,z  —  V3)  =0.  That  is,  (x,y,z)  is  in  the 
plane  if  and  only  if 


(a,b,c)  ■  (x-vi,y-v2,z-v3)  =  0 
a(x  —  v\)  +  b(y-v2)  +c(z  —  V3)  =  0 
ax  +  by  +  cz  —  av  1  —  bv2  —  cv  3  =  0 

ax  +  by  +  cz  —  av  1  +  bv2  +  cv  3 . 

Working  backwards,  note  that  if  (. x,y,z )  is  a  point  satisfying  ax  +  by  +  cz  —  d  then 

ax  +  by  +  cz  —  d 
ax  +  by  +  cz  —  d  =  0 
a(x  —  d/a)  +b(y  —  0)  +c(z  —  0)  =  0 
(a,b,c)  ■  (x  —  d/a,y,z)  —  0. 

Namely,  ( a,b,c )  is  perpendicular  to  the  vector  with  tail  at  (d/a,  0,0)  and  head  at  (x,y,z).  This  means  that 
the  points  (x,y,z)  that  satisfy  the  equation  ax  +  by  +  cz  —  d  form  a  plane  perpendicular  to  (a, b. c) .  (This 
doesn’t  work  if  a  —  0,  but  in  that  case  we  can  use  b  or  c  in  the  role  of  a.  That  is,  either  a(x  —  0)  +  b(y  — 
d/b)  -T  c(z  —  0)  =  0  or  a(x  —  0)  +b(y  —  0)  +  c(z  —  d/c)  —  0.) 


Figure  12.11:  A  plane  defined  via  vectors  perpendicular  to  a  normal. 

Thus,  given  a  vector  (a,  b,  c)  we  know  that  all  planes  perpendicular  to  this  vector  have  the  form  ax  + 
by-\-cz  —  d,  and  any  surface  of  this  form  is  a  plane  perpendicular  to  (a,  b,  c) . 
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Example  12.8:  Perpendicular  Plane 


Find  an  equation  for  the  plane  perpendicular  to  (1,2,3)  and  containing  the  point  (5,0,7). 


Solution.  Using  the  formula  above,  the  plane  is  Ijc  +  2 y  +  3 z  =  d.  To  find  d  we  may  substitute  the  known 
point  on  the  plane  to  get  5  +  2  •  0  +  3  •  7  =  d,  so  d  —  26.  X 


Example  12.9:  Normal  Vector 


Find  a  vector  normal  to  the  plane  2x  —  3y  +  z  —  15. 


Solution.  One  example  is  (2, —3,1).  Any  vector  parallel  or  anti-parallel  to  this  works  as  well,  so  for 
example  —2(2,  —3, 1)  =  (—4,6,  —2)  is  also  normal  to  the  plane.  X 

We  will  frequently  need  to  find  an  equation  for  a  plane  given  certain  information  about  the  plane. 
While  there  may  occasionally  be  slightly  shorter  ways  to  get  to  the  desired  result,  it  is  always  possible, 
and  usually  advisable,  to  use  the  given  information  to  find  a  normal  to  the  plane  and  a  point  on  the  plane, 
and  then  to  find  the  equation  as  above. 


Example  12.10:  Plane  Perpendicular 


The  planes  x  —  z  —  1  and  y  +  2z  —  3  intersect  in  a  line.  Find  a  third  plane  that  contains  this  line  and 
is  perpendicular  to  the  plane  x  +  y  —  2z—  1 . 


Solution.  First,  we  note  that  two  planes  are  perpendicular  if  and  only  if  their  normal  vectors  are  perpen¬ 
dicular.  Thus,  we  seek  a  vector  ( a,h,c )  that  is  perpendicular  to  (1, 1,-2).  In  addition,  since  the  desired 
plane  is  to  contain  a  certain  line,  (a,  b,  c)  must  be  perpendicular  to  any  vector  parallel  to  this  line.  Since 
(a,  b,  c)  must  be  perpendicular  to  two  vectors,  we  may  find  it  by  computing  the  cross  product  of  the  two. 

Therefore  we  need  a  vector  parallel  to  the  line  of  intersection  of  the  given  planes.  For  this,  it  suffices  to 
know  two  points  on  the  line.  To  find  two  points  on  this  line,  we  must  find  two  points  that  are  simultaneously 
on  the  two  planes,  x  —  z—1  and  y  +  2z  —  3.  Any  point  on  both  planes  will  satisfy  x  —  z  —  1  and  y  +  2z  —  3. 
It  is  easy  to  find  values  for  x  and  z  satisfying  the  first,  such  as  x  —  l,z  —  0  and  x  —  2 ,z—  1.  Then  we  can 
find  corresponding  values  for  y  using  the  second  equation,  namely  y  —  3  and  y  =  1,  so  (1,3,0)  and  (2, 1, 1) 
are  two  such  points.  They  are  both  on  the  line  of  intersection  since  they  are  contained  in  both  planes. 

Now  (2 — 1,1  —  3, 1  — 0)  =  (1,— 2, 1)  is  parallel  to  the  line.  Finally,  we  may  choose  ( a,b,c )  =  (1,1,— 2)  x 
(1,-2, 1)  =  (—3,  —3,  —3).  While  this  vector  will  do  perfectly  well,  any  vector  parallel  or  anti-parallel  to 
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it  will  work  as  well.  For  example  we  might  choose  (1,1,1)  which  is  anti-parallel  to  it,  and  easier  to  work 
with. 

Now  we  know  that  (1,1,1)  is  normal  to  the  desired  plane  and  (2, 1, 1)  is  a  point  on  the  plane.  This 
gives  an  equation  of  (l)jc-|-  (l)>'-(-  (l)z  =  Substituting  the  value  of  the  point  into  the  equation  gives 
d  =  4,  and  therefore  an  equation  of  the  plane  is  x  +  y  +  z  —  4.  Asa  quick  check,  since  (1,3,0)  is  also  on 
the  line,  it  should  be  on  the  plane;  since  1  +  3  +  0  =  4,  we  see  that  this  is  indeed  the  case. 

Note  that  had  we  used  (—3,  —3,  —3)  as  the  normal,  we  would  have  discovered  the  equation  —  3x  —  3y  — 
3 z  —  —12.  Then  we  might  well  have  noticed  that  we  could  divide  both  sides  by  —3  to  get  the  equivalent 

x  +  y  +  z  =  4.  4 

We  will  now  turn  our  attention  to  a  study  of  lines.  Unfortunately,  it  turns  out  to  be  quite  inconvenient 
to  represent  a  typical  line  with  a  single  equation;  we  need  to  approach  lines  in  a  different  way. 

Unlike  a  plane,  a  line  in  three  dimensions  does  have  an  obvious  direction,  namely,  the  direction  of  any 
vector  parallel  to  it.  In  fact  a  line  can  be  defined  and  uniquely  identified  by  providing  one  point  on  the  line 
and  a  vector  parallel  to  the  line  (in  one  of  two  possible  directions).  That  is,  the  line  consists  of  exactly 
those  points  we  can  reach  by  starting  at  the  point  and  going  for  some  distance  in  the  direction  of  the  vector. 
Let’s  see  how  we  can  translate  this  into  more  mathematical  language. 

Suppose  a  line  contains  the  point  (vi,V2,V3)  and  is  parallel  to  the  vector  ( a,b,c ).  If  we  place  the  vector 
(vi, V2, V3)  with  its  tail  at  the  origin  and  its  head  at  (vi,V2,V3),  and  if  we  place  the  vector  ( a,b,c )  with  its 
tail  at  (v1.v2.v3),  then  the  head  of  ( a,b,c )  is  at  a  point  on  the  line.  We  can  get  to  any  point  on  the  line  by 
doing  the  same  thing,  except  using  t(a,b,c)  in  place  of  ( a,b,c ),  where  t  is  some  real  number.  Because  of 
the  way  vector  addition  works,  the  point  at  the  head  of  the  vector  t{a,b,c )  is  the  point  at  the  head  of  the 
vector  (v1.v2.v3)  +  t(a,b,c),  namely  (vi  +ta,V2  +tb,vs  +tc);  see  Figure  12.12. 


Figure  12.12:  Vector  form  of  a  line. 


In  other  words,  as  t  runs  through  all  possible  real  values,  the  vector  (v1.v2.v3)  +t{a,b,c)  points  to 
every  point  on  the  line  when  its  tail  is  placed  at  the  origin.  It  is  occasionally  useful  to  use  this  form  of  a 
line  even  in  two  dimensions;  a  vector  form  for  a  line  in  thex-y  plane  is  (vi,V2)  +t{a,b),  which  is  the  same 
as  (vi,v2,0)  +t{a,b,0). 


The  vector  (a,  b,  c)  is  called  the  direction  vector  for  the  line. 
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Another  common  way  to  write  this  is  as  a  set  of  parametric  equations: 

x  —  vi+ta  y  —  vi  +  tb  z  —  v^+tc. 


Example  12.11:  Vector  Expression 


Find  a  vector  expression  for  the  line  through  (6, 1 ,  — 3)  and  (2, 4,5). 


Solution.  To  get  a  vector  parallel  to  the  line  we  subtract  (6, 1,-3)  —  (2,4,5)  =  (4, —3,— 8).  The  line  is 
then  given  by  (2,4,5)  +  t(4,  —  3,—  8);  there  are  of  course  many  other  possibilities,  such  as  (6, 1,-3) 
t(4,  —3,  —8). 


Example  12.12:  Intersecting  Lines 


Determine  whether  the  lines  (1, 1, 1)  +t(  1,2,  — 1)  and  (3,2, 1)  +  /(—!,  -5, 3)  are  parallel,  intersect, 
or  neither. 


Solution.  In  two  dimensions,  two  lines  either  intersect  or  are  parallel;  in  three  dimensions,  lines  that  do 
not  intersect  might  not  be  parallel.  In  this  case,  since  the  direction  vectors  for  the  lines  are  not  parallel  or 
anti-parallel  we  know  the  lines  are  not  parallel.  If  they  intersect,  there  must  be  two  values  a  and  b  so  that 
(1, 1, 1)  +  a(l,2,  —  1)  =  (3,2, 1)  -t-Z?( — 1,  — 5,3).  That  is,  the  following  must  have  a  solution: 

1  +a  —  3  —  b 
l+2a  =  2  —  5b 
1  — a  —  1  +  3  b 

This  gives  three  equations  in  two  unknowns,  so  there  may  or  may  not  be  a  solution  in  general.  In  this  case, 
it  is  easy  to  discover  that  a  =  3  and  b  =  —  1  satisfies  all  three  equations.  Substituting  these  values  into 
(1, 1, 1)  +a(l,2,  — 1)  =  (3,2, 1)  +  b(—  1,—  5,3),  the  point  of  intersection  is  (4,7,— 2).  X 


Example  12.13:  Distance  from  a  Point  to  a  Plane 


Find  the  distance  from  the  point  ( 1 , 2, 3)  to  the  plane  2x  —  y  +  3z  —  5. 


*  + 
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Solution.  The  distance  from  a  point  P  to  a  plane  is  the  shortest  distance  from  P  to  any  point  on  the  plane; 
this  is  the  distance  measured  from  P  perpendicular  to  the  plane;  see  Figure  12.13.  This  distance  is  the 

absolute  value  of  the  scalar  projection  of  QP  onto  a  normal  vector  n,  where  Q  is  any  point  on  the  plane.  It 
is  easy  to  find  a  point  on  the  plane,  say  (1,0, 1).  Thus  the  distance  is 


(0,2,2) -(2,-1, 3)  _  4 
|(2, -1,3)|  vT4 


X 


'\ 

/ 

s  I 

I 

I 

I 


Figure  12.13:  Distance  from  a  point  to  a  plane. 


Solution.  Again  we  want  the  distance  measured  perpendicular  to  the  line,  as  indicated  in  Figure  12.14. 
The  desired  distance  is 


\QP\  sin0 


QP  x  v 

Ivl 


where  v  is  any  vector  parallel  to  the  line.  From  the  equation  of  the  line,  we  can  use  Q  =  (1,1,1)  and 
v  =  (2,3,  — 1)  along  withP=  (  — 1,2, 1),  so  the  distance  is 


| (—2, 1,0)  x  (2,3,-!)]  =  |(— 1,-2,— 8)|  =  ^69 
y/u  vT4  vT4‘ 


X 


Figure  12.14:  Distance  from  a  point  to  a  line. 
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Exercises  for  12.5 

Exercise  12.5.1  Find  an  equation  of  the  plane  containing  (6,2, 1)  and  perpendicular  to  (1, 1, 1). 

Exercise  12.5.2  Find  an  equation  of  the  plane  containing  (  —  1,2, —3)  and  perpendicular  to  (4,5,  — 1). 

Exercise  12.5.3  Find  an  equation  of  the  plane  containing  (1,2, —3),  (0, 1,-2)  and  (1,2, —2). 

Exercise  12.5.4  Find  an  equation  of  the  plane  containing  (1,0,0),  (4,2,0)  and  (3,2, 1). 

Exercise  12.5.5  Find  an  equation  of  the  plane  containing  (1,0,0)  and  the  line  (1,0,2)  + 1  (3, 2, 1). 

Exercise  12.5.6  Find  an  equation  of  the  plane  containing  the  line  of  intersection  ofx  +  y  +  z  —  1  and 
x  —  y  +  2z  —  2,  and  perpendicular  to  the  x-y  plane. 

Exercise  12.5.7  Find  an  equation  of  the  line  through  (1,0,3)  and  (1,2,4). 

Exercise  12.5.8  Find  an  equation  of  the  line  through  (1,0,3)  and  perpendicular  to  the  plane  x  +  2y  —  z  — 
1. 

Exercise  12.5.9  Find  an  equation  of  the  line  through  the  origin  and  perpendicular  to  the  plane  x-\-y  —  z  — 
2. 

Exercise  12.5.10  Find  a  and  c  so  that  (a,  l,c)  is  on  the  line  through  (0,2,3)  and  (2,7,5). 

Exercise  12.5.11  Explain  how  to  discover  the  solution  in  Example  12.12. 

Exercise  12.5.12  Determine  whether  the  lines  (1,3,  — 1)  +l(l,  1,0)  and  (0,0,0)  +  ?(1,4,5)  are  parallel, 
intersect,  or  neither. 

Exercise  12.5.13  Determine  whether  the  lines  (1,0,2)  +7  (—1,  —  1,2)  and  (4,4,2)  +/(2,2,  —4)  are  paral¬ 
lel,  intersect,  or  neither. 

Exercise  12.5.14  Determine  whether  the  lines  (1,2,  —  1)  +/(1,2, 3)  and  (1,0, 1)  +1  (2/3, 2, 4/3)  are  par¬ 
allel,  intersect,  or  neither. 

Exercise  12.5.15  Determine  whether  the  lines  (1, 1,2)  +t  (1,2,  —3)  and  (2,3,  —  1)  +l(2,4,  —6)  are  paral¬ 
lel,  intersect,  or  neither. 

Exercise  12.5.16  Find  a  unit  normal  vector  to  each  of  the  coordinate  planes. 

Exercise  12.5.17  Show  that  (2, 1,3)  +t(  1, 1,2)  and  (3,2,5)  +s( 2,2,4)  are  the  same  line. 


Exercise  12.5.18  Give  a  prose  description  for  each  of  the  following  processes: 
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(a)  Given  two  distinct  points,  find  the  line  that  goes  through  them. 

(b)  Given  three  points  (not  all  on  the  same  line),  find  the  plane  that  goes  through  them.  Why  do  we  need 
the  caveat  that  not  all  points  be  on  the  same  line? 

(c)  Given  a  line  and  a  point  not  on  the  line,  find  the  plane  that  contains  them  both. 

( d)  Given  a  plane  and  a  point  not  on  the  plane,  find  the  line  that  is  perpendicular  to  the  plane  through 
the  given  point. 


Exercise  12.5.19 
Exercise  12.5.20 
Exercise  12.5.21 
Exercise  12.5.22 
Exercise  12.5.23 
Exercise  12.5.24 


Find  the  distance  from  (2,2,2)  to  x-\-y  +  z  =  — 1. 

Find  the  distance  from  (2,  —  1 ,  —  1 )  to  2x  —  3y  +  z  —  2. 

Find  the  distance  from  (2,-1, 1)  to  (2,2,0)  +  t{  1,2,3). 

Find  the  distance  from  (1,0, 1)  to  (3,2, 1)  +  t( 2, -1,-2). 

Find  the  cosine  of  the  angle  between  the  planes  x  +  y  +  z  —  2  and  x  +  2y  +  3z  =  8. 
Find  the  cosine  of  the  angle  between  the  planes  x  ~  y  +  2z  =  2  and  3x  —  2 y  +  z  —  5. 


12.6  Other  Coordinate  Systems 


Coordinate  systems  are  tools  that  let  us  use  algebraic  methods  to  understand  geometry.  While  the  rect¬ 
angular  (also  called  Cartesian)  coordinates  that  we  have  been  discussing  are  the  most  common,  some 
problems  are  easier  to  analyze  in  alternate  coordinate  systems. 

A  coordinate  system  is  a  scheme  that  allows  us  to  identify  any  point  in  the  plane  or  in  three-dimensional 
space  by  a  set  of  numbers.  In  rectangular  coordinates  these  numbers  are  interpreted,  roughly  speaking,  as 
the  lengths  of  the  sides  of  a  rectangular  “box.” 

In  two  dimensions  you  may  already  be  familiar  with  an  alternative,  called  polar  coordinates.  In  this 
system,  each  point  in  the  plane  is  identified  by  a  pair  of  numbers  (r,  0).  The  number  0  measures  the 
counter-clockwise  angle  between  the  positive  x-axis  and  a  vector  with  tail  at  the  origin  and  head  at  the 
point,  as  shown  in  Figure  12.15;  the  number  r  measures  the  distance  from  the  origin  to  the  point.  Either 
of  these  may  be  negative;  a  negative  0  indicates  the  angle  is  measured  clockwise  from  the  positive  x-axis 
instead  of  counter-clockwise,  and  a  negative  r  indicates  the  point  at  distance  \r\  in  the  opposite  of  the 
direction  given  by  0. 

The  relationship  between  polar  and  rectangular  coordinates  is  given  by 


x  =  rcos0 
y  =  rsin0 
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and 


r=  y/ x2  +  y1 

tan  9  =  - 
x 


Example  12.15:  Rectangular  to  Polar  Coordinates 


Convert  the  point  (x,y)  —  ( 1 ,  \/3)  into  polar  coordinates. 


Solution.  First  calculate  r. 

r  —  \J x2  +  y2  =  VTT3  =  2 

Now  find  9  such  that  tan0  =  The  required  9  is  y. 

The  polar  coordinates  are  (2,  |).  £ 

Figure  12.15  also  shows  the  point  with  rectangular  coordinates  (l,\/3)  and  polar  coordinates  (2,  tt/3), 

2  units  from  the  origin  and  k/ 3  radians  from  the  positive  v-axis. 


(2,n/3) 


Figure  12.15:  Polar  coordinates:  the  general  case  and  the  point  with  rectangular  coordinates 

(1V3). 


We  can  extend  polar  coordinates  to  three  dimensions  simply  by  adding  a  z  coordinate;  this  is  called 
cylindrical  coordinates.  Each  point  in  three-dimensional  space  is  represented  by  three  coordinates 
(r,  9,z )  in  the  obvious  way:  this  point  is  z  units  above  or  below  the  point  (r,  9)  in  the  x-y  plane,  as  shown 
in  Figure  12.16.  The  point  with  rectangular  coordinates  (l,\/3,3)  and  cylindrical  coordinates  (2, tt/3, 3) 
is  also  indicated  in  Figure  12.16. 
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Figure  12.16:  Cylindrical  coordinates:  the  general  case  and  the  point  with  rectangular  coordinates 

(Kv'3-3). 


Some  figures  with  relatively  complicated  equations  in  rectangular  coordinates  will  be  represented  by 
simpler  equations  in  cylindrical  coordinates.  For  example,  the  cylinder  in  Figure  12.17  has  equation 
x2  +y2  —  4  in  rectangular  coordinates,  but  equation  r  —  2  in  cylindrical  coordinates. 


Figure  12.17:  The  cylinder  r  =  2. 


Given  a  point  ( r,  0 )  in  polar  coordinates,  it  is  easy  to  see  (as  in  Figure  12.15)  that  the  rectangular 
coordinates  of  the  same  point  are  (rcos  0,rsin  0),  and  so  the  point  (r,  6,z)  in  cylindrical  coordinates  is 
(rcos0,rsin0,z)  in  rectangular  coordinates.  This  means  it  is  usually  easy  to  convert  any  equation  from 
rectangular  to  cylindrical  coordinates:  simply  substitute 

x  —  rcos 6 
y  —  r  sin0 

and  leave  z  alone.  For  example,  starting  with  x2+y2  —  4  and  substituting  x  =  r  cos  0,  y  —  r  sin  0  gives 

r2  cos2  0  +  r2  sin2  0=4 
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r2(cos20  +  sin20)  —4 
r2  —  4 
r  =  2. 


Of  course,  it’s  easy  to  see  directly  that  this  defines  a  cylinder  as  mentioned  above. 

Cylindrical  coordinates  are  an  obvious  extension  of  polar  coordinates  to  three  dimensions,  but  the 
use  of  the  z  coordinate  means  they  are  not  as  closely  analogous  to  polar  coordinates  as  another  standard 
coordinate  system.  In  polar  coordinates,  we  identify  a  point  by  a  direction  and  distance  from  the  origin; 
in  three  dimensions  we  can  do  the  same  thing,  in  a  variety  of  ways.  The  question  is:  how  do  we  represent 
a  direction?  One  way  is  to  give  the  angle  of  rotation,  0,  from  the  positive  x  axis,  just  as  in  cylindrical 
coordinates,  and  also  an  angle  of  rotation,  (f> ,  from  the  positive  z  axis.  Roughly  speaking,  0  is  like  longitude 
and  (j)  is  like  latitude.  (Earth  longitude  is  measured  as  a  positive  or  negative  angle  from  the  prime  meridian, 
and  is  always  between  0  and  180  degrees,  east  or  west;  9  can  be  any  positive  or  negative  angle,  and  we 
use  radians  except  in  informal  circumstances.  Earth  latitude  is  measured  north  or  south  from  the  equator; 
(j)  is  measured  from  the  north  pole  down.)  This  system  is  called  spherical  coordinates;  the  coordinates 
are  listed  in  the  order  (p,  0, 0),  where  p  is  the  distance  from  the  origin,  and  like  r  in  polar  and  cylindrical 
coordinates  it  may  be  negative.  The  general  case  and  an  example  are  pictured  in  Figure  12.18;  the  length 
marked  r  is  the  r  of  cylindrical  coordinates. 


z  z 


Figure  12.18:  Spherical  coordinates:  the  general  case  and  the  point  with  rectangular  coordinates 

(l,v/3,3). 


As  with  cylindrical  coordinates,  we  can  easily  convert  equations  in  rectangular  coordinates  to  the 
equivalent  in  spherical  coordinates,  though  it  is  a  bit  more  difficult  to  discover  the  proper  substitutions. 
Figure  12.19  shows  the  typical  point  in  spherical  coordinates  from  Figure  12.18,  viewed  now  so  that  the 
arrow  marked  r  in  the  original  graph  appears  as  the  horizontal  “axis”  in  the  left  hand  graph.  From  this 
diagram  it  is  easy  to  see  that  the  z  coordinate  is  p  cos  (/) ,  and  that  r  —  p  sin  (j) ,  as  shown.  Thus,  in  converting 
from  rectangular  to  spherical  coordinates  we  will  replace  z  by  p  cos  (j).  To  see  the  substitutions  for  x  and  y 
we  now  view  the  same  point  from  above,  as  shown  in  the  right  hand  graph.  The  hypotenuse  of  the  triangle 
in  the  right  hand  graph  is  r  —  p  sin^,  so  the  sides  of  the  triangle,  as  shown,  are  x  —  r  cos  9  —  p  sin  (j)  cos  0 
and  y  =  rsin0  —  p  sin  (j)  sin  0.  Therefore  to  convert  from  rectangular  to  spherical  coordinates,  we  make 
these  substitutions: 


x  —  p  sin  (j)  cos  0 
y  =  p  sin  (j)  sin  0 
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Z  —  pc  OS(f). 


Figure  12.19:  Converting  from  rectangular  to  spherical  coordinates. 

As  the  cylinder  had  a  simple  equation  in  cylindrical  coordinates,  so  does  the  sphere  in  spherical  coor¬ 
dinates. 


Example  12.16: 


Find  an  equation  for  the  sphere  of  radius  2  in  spherical  coordinates. 


Solution.  If  we  start  with  the  Cartesian  equation  of  the  sphere  and  substitute,  we  get  the  spherical  equation: 

x2+y2  +z2  —  22 
p 2  sin2  (j)  cos2  0  +  p2  sin2  (j)  sin2  9  +  p2  cos2  (j)  —  2r 
p2  sin2  (j) (cos2  0  +  sin2  9)  +p2cos2(j)  =  22 
p2  sin2  (j)  +  p2  cos2  (j)  =  22 
p2(sin20  +  cos20)  =  22 
p2  =  22 

P=  2 

Therefore,  in  spherical  coordinates,  a  sphere  of  radius  2  is  expressed  p  —2.  X 

Although  not  as  simple  as  with  cylindrical  coordinates,  we  can  use  spherical  coordinates  to  describe 
the  equation  of  a  cylinder. 


Example  12.17:  Cylinder  Equation  in  Spherical  Coordinates 


Find  an  equation  for  the  cylinder x2  +  y2  =  4  in  spherical  coordinates. 


x2  +y2  =  4 


Solution.  Proceeding  as  in  the  previous  example: 
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p2  sin2  (j)  cos2  0  +  p2  sin2  (j)  sin2  0—4 

p2sin20(cos20  +  sin20)  —  4 
p2sin2(j)  =  4 


p  sin  (j)  =  2 

2 

p  = - 

sin0 


* 


Exercises  for  12.6 


Exercise  12.6.1  Convert  the  following  points  in  rectangular  coordinates  to  cylindrical  and  spherical 
coordinates: 

(a)  (1,1,1) 

(b)  (7, -7, 5) 

(c)  (cos(l),sin(l),  1) 

(d)  (0,0, -tt) 

Exercise  12.6.2  Find  an  equation  for  the  sphere  x2  +y2  +  z2  =  4  in  cylindrical  coordinates. 

Exercise  12.6.3  Find  an  equation  for  the  y-z  plane  in  cylindrical  coordinates. 

Exercise  12.6.4  Find  an  equation  equivalent  to  x1  +  y2  +-  2 z2  +  2z  —  5  =  0  in  cylindrical  coordinates. 

_  2 

Exercise  12.6.5  Suppose  the  cur\’e  z.  —  e  1  in  the  x-z  plane  is  rotated  around  the  z  axis.  Find  an  equation 
for  the  resulting  surface  in  cylindrical  coordinates. 

Exercise  12.6.6  Suppose  the  cur\’e  z—x  in  the  x-z  plane  is  rotated  around  the  z  axis.  Find  an  equation  for 
the  resulting  surface  in  cylindrical  coordinates. 

Exercise  12.6.7  Find  an  equation  for  the  plane  y  —  0  in  spherical  coordinates. 

Exercise  12.6.8  Find  an  equation  for  the  plane  z.  —  1  in  spherical  coordinates. 

Exercise  12.6.9  Find  an  equation  for  the  sphere  with  radius  1  and  center  at  (0, 1,0)  in  spherical  coordi¬ 
nates. 

Exercise  12.6.10  Find  an  equation  for  the  cylinder  x2  +  y2  =  4  in  spherical  coordinates. 
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Exercise  12.6.11  Suppose  the  cur\>e  z  —  x  in  the  x-z  plane  is  rotated  around  the  z  axis.  Find  an  equation 
for  the  resulting  surface  in  spherical  coordinates. 

Exercise  12.6.12  Plot  the  polar  equations  r  =  sin(0)  and  r  —  cos(0)  and  comment  on  their  similarities. 
(If  you  get  stuck  on  how  to  plot  these,  you  can  multiply  both  sides  of  each  equation  by  r  and  convert  back 
to  rectangular  coordinates). 

Exercise  12.6.13  Extend  Exercises  12.6.6  and  12.6.11  by  rotating  the  cur\>e  z  =  mx  around  the  z.  axis  and 
converting  to  both  cylindrical  and  spherical  coordinates. 

Exercise  12.6.14  Convert  the  spherical  formula  p  =  sin  0  sin  (j)  to  rectangular  coordinates  and  describe 
the  surface  defined  by  the  formula  (Hint:  Multiply  both  sides  by  p.) 

Exercise  12.6.15  We  can  describe  points  in  the  first  octant  by  x  >  0,  y  >  0  and  z  >  0.  Give  similar 
inequalities  for  the  first  octant  in  cylindrical  and  spherical  coordinates. 


13.  Partial  Differentiation 


13.1  Functions  of  Several  Variables 


In  single- variable  calculus  we  were  concerned  with  functions  that  map  the  real  numbers  R  to  R,  sometimes 
called  “real  functions  of  one  variable”,  meaning  the  “input”  is  a  single  real  number  and  the  “output”  is 
likewise  a  single  real  number.  Now  we  turn  to  functions  of  several  variables,  where  several  input  variables 
are  mapped  to  one  value:  functions  / :  R”  — >  R.  We  will  deal  primarily  with  n  —  2  and  to  a  lesser  extent 
—  in  fact  many  of  the  techniques  we  discuss  can  be  applied  to  larger  values  of  n  as  well. 

A  function  /:  R2  — >  R  maps  a  pair  of  values  (x,y)  to  a  single  real  number.  The  three-dimensional 
coordinate  system  we  have  already  used  is  a  convenient  way  to  visualize  such  functions:  above  each  point 
(x,y)  in  the  x-y  plane  we  graph  the  point  (x,y,z),  where  of  course  z  —  f(x,y). 


Example  13.1:  Plane 


Describe  the  function  f(x,y )  =  3x  +  4y  —  5. 


Solution.  Writing  this  as  z  =  3x  +  Ay  —  5  and  then  3x  +  4y  —  z  —  5  we  recognize  the  equation  of  a  plane.  In 
the  form  f(x,y )  =  3x  +  4y  —  5  the  emphasis  has  shifted:  we  now  think  of  x  and  y  as  independent  variables 
and  z  as  a  variable  dependent  on  them,  but  the  geometry  is  unchanged.  £ 


Solution.  We  have  seen  that  x2  +y2  +z2  =  4  represents  a  sphere  of  radius  2.  We  cannot  write  this  in  the 
form  f(x,y) ,  since  for  each  x  and  y  in  the  disk x2  +  y2  <  4  there  are  two  corresponding  points  on  the  sphere. 
As  with  the  equation  of  a  circle,  we  can  resolve  this  equation  into  two  functions,  fi(x,y)  —  y/4  —  x2  —  y2 
and  /2(x,y)  =  —  y/ 4  —  x2  —  y2,  representing  the  upper  and  lower  hemispheres,  respectively.  Each  of  these 
is  an  example  of  a  function  with  a  restricted  domain:  only  certain  values  of  x  and  y  make  sense  (namely, 
those  for  which  x2  +y2  <  4)  and  the  graphs  of  these  functions  are  limited  to  a  small  region  of  the  plane. 

* 


Example  13.3:  Square  Root 


Describe  the  function  f(x,y)  =  y/x  +  yfy. 
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Solution.  This  function  is  defined  only  when  both  x  and  y  are  non-negative.  When  y  —  0  we  get  f(x,y )  = 
y/x,  the  familiar  square  root  function  in  the  x-z  plane,  and  when  x  —  0  we  get  the  same  curve  in  the  y-z 
plane.  Generally  speaking,  we  see  that  starting  from  /( 0,0)  =  0  this  function  gets  larger  in  every  direction 
in  roughly  the  same  way  that  the  square  root  function  gets  larger.  For  example,  if  we  restrict  attention  to 
the  linex  =  y,  we  get  /(x,y)  —2y/x&nd  along  the  liney  =  2xwe  have  f(x,y)  —  y/x  +  V 2x—  (l  +  \/2)y/x. 

X 


Figure  13.1:  /(x,y)  =  y/x  +  y/y 


A  computer  program  that  plots  such  surfaces  can  be  very  useful,  as  it  is  often  difficult  to  get  a  good 
idea  of  what  they  look  like.  Still,  it  is  valuable  to  be  able  to  visualize  relatively  simple  surfaces  without 
such  aids.  As  in  the  previous  example,  it  is  often  a  good  idea  to  examine  the  function  on  restricted  subsets 
of  the  plane,  especially  lines.  It  can  also  be  useful  to  identify  those  points  (x,y)  that  share  a  common 
z- value. 


Solution.  When  x  —  0  this  becomes  f  —  y2,  a  parabola  in  the  y-z  plane;  when  y  =  0  we  get  the  “same” 
parabola  f  —  x2  in  the  x-z  plane. 

Finally,  picking  a  value  z  —  k,  at  what  points  does  /(x,y )  =  k?  This  means  x2  +y2  —  k,  which  we 
recognize  as  the  equation  of  a  circle  of  radius  \fk.  So  the  graph  of  f(x,y)  has  parabolic  cross-sections,  and 
the  same  height  everywhere  on  concentric  circles  with  center  at  the  origin.  This  fits  with  what  we  have 
already  discovered.  X 
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As  in  this  example,  the  points  (x,y)  such  that  f(x,y )  =  k  usually  form  a  curve,  called  a  level  curve 
of  the  function.  A  graph  of  some  level  curves  can  give  a  good  idea  of  the  shape  of  the  surface;  it  looks 
much  like  a  topographic  map  of  the  surface.  In  Figure  13.2  both  the  surface  and  its  associated  level  curves 
are  shown.  Note  that,  as  with  a  topographic  map,  the  heights  corresponding  to  the  level  curves  are  evenly 
spaced,  so  that  where  curves  are  closer  together  the  surface  is  steeper. 

Functions  /:  R”  — *  R  behave  much  like  functions  of  two  variables;  we  will  on  occasion  discuss  func¬ 
tions  of  three  variables.  The  principal  difficulty  with  such  functions  is  visualizing  them,  as  they  do  not 
“fit”  in  the  three  dimensions  we  are  familiar  with.  For  three  variables  there  are  various  ways  to  interpret 
functions  that  make  them  easier  to  understand.  For  example,  f(x,y,z )  could  represent  the  temperature  at 
the  point  (jt,y,z),  or  the  pressure,  or  the  strength  of  a  magnetic  field.  It  remains  useful  to  consider  those 
points  at  which  f(x,y,z)  —  k,  where  k  is  some  constant  value.  If  f(x,y,z )  is  temperature,  the  set  of  points 
(x,y,z)  such  that  f(x,y,z )  —  k  is  the  collection  of  points  in  space  with  temperature  k;  in  general  this  is 
called  a  level  set;  for  three  variables,  a  level  set  is  typically  a  surface,  called  a  level  surface. 


Example  13.5:  Level  Surfaces 


Suppose  the  temperature  at  ( x,y,z )  is  T(x,y,z )  =  e~('~+>,2+~)  This  function  has  a  maximum  value 
of  1  at  the  origin,  and  tends  to  0  in  all  directions.  If  k  is  positive  and  at  most  1,  the  set  of  points  for 
which  T(x,y,z )  =  k  is  those  points  satisfying  x2  +y2  +  z2  —  —  Ink,  a  sphere  centered  at  the  origin. 
The  level  surfaces  are  the  concentric  spheres  centered  at  the  origin. 


Exercises  for  13.1 


Exercise  13.1.1  Let  f(x,y)  —  (x  —  y)2.  Determine  the  equations  and  shapes  of  the  cross-sections  when 
x  =  0,  y  —  0,  x  —  y,  and  describe  the  level  curves.  Use  a  three-dimensional  graphing  tool  to  graph  the 
surface. 
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Exercise  13.1.2  Let  f(x,y )  =  \x\  +  |>’|.  Determine  the  equations  and  shapes  of  the  cross-sections  when 
x  —  0,  y  —  0,  x  —  y,  and  describe  the  level  cur\>es.  Use  a  three-dimensional  graphing  tool  to  graph  the 
surface. 

Exercise  13.1.3  Let  f(x,y)  =  e~  {x~ 1  sin(x2  +  y2).  Determine  the  equations  and  shapes  of  the  cross- 
sections  when  x  —  0,  y  =  Q,  x  —  y,  and  describe  the  level  curx’es.  Use  a  three-dimensional  graphing  tool 
to  graph  the  surface. 

Exercise  13.1.4  Let  f(x,y)  —  sinf.r  —  y).  Determine  the  equations  and  shapes  of  the  cross-sections  when 
x  =  0,  y  —  0,  x  —  y,  and  describe  the  level  cur\>es.  Use  a  three-dimensional  graphing  tool  to  graph  the 
surface. 

Exercise  13.1.5  Let  f(x,y)  —  (x2  —  v2)2-  Determine  the  equations  and  shapes  of  the  cross-sections  when 
x  =  0,  y  —  0,  x  —  y,  and  describe  the  level  cur\>es.  Use  a  three-dimensional  graphing  tool  to  graph  the 
surface. 

Exercise  13.1.6  Find  the  domain  of  each  of  the  following  functions  of  two  variables: 

(a)  \J 9  —  x2  +  \Jy2  —  4 

(b)  arcsin(x2  +y2  —  2) 

(c)  \J  1 6  —  x2  —  4_v2 


Exercise  13.1.7  Below  are  two  sets  of  level  cur\’es.  One  is  for  a  cone,  one  is  for  a  paraboloid.  Which  is 
which?  Explain. 


13.2  Limits  and  Continuity 


To  develop  calculus  for  functions  of  one  variable,  we  needed  to  make  sense  of  the  concept  of  a  limit, 
which  was  used  in  the  definition  of  a  continuous  function  and  the  derivative  of  a  function.  Limits  involving 
functions  of  two  variables  can  be  considerably  more  difficult  to  deal  with;  fortunately,  most  of  the  functions 
we  encounter  are  fairly  easy  to  understand. 

The  potential  difficulty  is  largely  due  to  the  fact  that  there  are  many  ways  to  “approach”  a  point  in  the 
x-y  plane.  If  we  want  to  say  that  lim  f(x,y)  —  L,  we  need  to  capture  the  idea  that  as  (jt,y)  gets  close 

(x,y  )-r(a,b) 
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to  (a,b)  then  f(x,y )  gets  close  to  L.  For  functions  of  one  variable,  fix),  there  are  only  two  ways  that  x  can 
approach  a:  from  the  left  or  right.  But  there  are  an  infinite  number  of  ways  to  approach  (a,b):  along  any 
one  of  an  infinite  number  of  straight  lines,  or  even  along  a  curved  path  in  the  x,y-plane.  We  might  hope 
that  it’s  really  not  so  bad — suppose,  for  example,  that  along  every  possible  line  through  (a,  b)  the  value  of 
f(x,y )  gets  close  to  L;  surely  this  means  that  “/(x,y)  approaches  L  as  (x,y)  approaches  ( a,b)”.  Sadly,  no. 


Solution.  When  x  =  0  or  y  =  0,  f(x,y )  is  0,  so  the  limit  of  f(x,y )  approaching  the  origin  along  either 
the  x  or  y  axis  is  0.  Moreover,  along  the  line  y  —  mx,  f(x,y )  =  m2x3/ (x2  +  m4x4).  As  x  approaches  0 
this  expression  approaches  0  as  well.  So  along  every  line  through  the  origin  f(x,y)  approaches  0.  Now 
suppose  we  approach  the  origin  along  x  —  y2.  Then 

y2y2  _  /  _  1 

7+7  "  W  ~  * 


f(x,y) 


462  Partial  Differentiation 


so  the  limit  is  1/2.  Looking  at  Figure  13.3,  it  is  apparent  that  there  is  a  ridge  above  x  —  y2.  Approaching 
the  origin  along  a  straight  line,  we  go  over  the  ridge  and  then  drop  down  toward  0,  but  approaching  along 
the  ridge  the  height  is  a  constant  1/2.  X 

Fortunately,  we  can  define  the  concept  of  limit  without  needing  to  specify  how  a  particular  point  is 
approached — indeed,  in  Definition  3.3,  we  didn't  need  the  concept  of  “approach.”  Roughly,  that  definition 
says  that  when  x  is  close  to  a  then  /(x)  is  close  to  L;  there  is  no  mention  of  “how”  we  get  close  to  a.  We 
can  adapt  that  definition  to  two  variables  quite  easily: 


This  says  that  we  can  make  |/(x,y)  —L\  <  £,  no  matter  how  small  £  is,  by  making  the  distance  from 
(x,y)  to  ( a,b )  “small  enough”. 


Solution.  Suppose  e  >  0.  Then 


3x2y 


x1  +  y2 


xz+y - 


■3M. 


Note  thatx2/ (x2  +y2)  <  1  and  \y\  —  yjy2  <  \J x2  +y2  <  8.  So 

t2 

- y3|y|  <  1  -3-  8. 

x2+y2  11 

We  want  to  force  this  to  be  less  than  £  by  picking  8  “small  enough.”  If  we  choose  8  —  e/3  then 


3x2y 


x2  +y2 


<  1  ‘3  ■  -  =  £. 


Recall  that  a  function  /(x)  is  continuous  at  x  =  a  if  lim/(x)  =  /(a).  We  can  say  exactly  the  same 

x — Va 

thing  about  a  function  of  two  variables:  /(x,y)  is  continuous  at  ( a,b )  if  lim  f{x,y)  —  f(a,b). 

(x,y  )^{a,b) 

The  function  f(x,y )  =  3 x2y/ (x2  +y2)  is  not  continuous  at  (0,0),  because  /(0,0)  is  not  defined.  How¬ 
ever,  we  know  that  lim  f(x,y )  =  0,  so  we  can  make  a  continuous  function,  by  extending  the  definition 

OcoO-Ko.o) 

of  /  so  that  /( 0,0)  =  0.  This  surface  is  shown  in  Figure  13.4. 
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Figure  13.4:  f(x,y)  =  .  2 

x “  +yz 

Note  that  we  cannot  extend  the  definition  of  the  function  in  Example  13.6  to  create  a  continuous 
function,  since  the  limit  does  not  exist  as  we  approach  (0,0). 

Fortunately,  the  functions  we  will  be  working  with  will  usually  be  continuous  almost  everywhere. 
As  with  single  variable  functions,  two  classes  of  common  functions  are  particularly  useful  and  easy  to 
describe.  A  polynomial  in  two  variables  is  a  sum  of  terms  of  the  form  axinyn,  where  a  is  a  real  number  and 
m  and  n  are  non-negative  integers.  A  rational  function  is  a  quotient  of  polynomials. 


Theorem  13.9:  Continuity  of  Functions 


Polynomials  are  continuous  everywhere.  Rational  functions  are  continuous  everywhere  they  are 
defined. 


Exercises  for  13.2 


Determine  whether  each  limit  exists.  If  it  does,  find  the  limit  and  prove  that  it  is  the  limit;  if  it  does  not, 
explain  how  you  know. 


Exercise  13.2.1  lim  ~ 

(*o0->(o,o)  *2  +y2 


Exercise  13.2.2  lim  '  -  0 

(x,y)^(0,0  )X-+y2 


Exercise  13.2.3  lim  — ~ 

(*oO-KO,o)  2x-  +yz 
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Exercise  13.2.4  lim 


4  4 

x *  -y 


Exercise  13.2.5  lim 


x,y)^>-{0,0)  x2  +y2 

sin(x2  +  y2) 


Exercise  13.2.6  lim 


(x,y)^(0,0)  x2  +  y2 
xy 


Exercise  13.2.7  lim 


wMW)  ■>/  2x2  +  y2 
e-*-?  -  1 


■2  i  -,,2 


(*o0->(o,o)  x*+y 


Exercise  13.2.8  lim 


a  a 

*  +  v 


x,y)— >-(0,0)  X  +y2 


Exercise  13.2.9  lim 


x2  +  sin2  y 


Exercise  13.2.10  lim 


0,0)  2.r2  +y2 

(x  —  l)2lnx 


(xjHO.o)  (x-  1  )2+y2 


Exercise  13.2.11  lim  3x  +  4y 

(x^H(i-t) 


Exercise  13.2.12 


4x2y 
lim  - 

(x,y)^(0,0)  X-  +y 


2 


Exercise  13.2.13  Does  the  function  f(x,y ) 


x-y 
1  +x2+y 


t  ?  Explain. 


—  y 

- - —  have  any  discontinuities?  What  about  f(x,y )  — 

l+x+y  '  v  ’ 


13.3  Partial  Differentiation 


The  derivative  of  a  function  of  a  single  variable  tells  us  how  quickly  the  value  of  the  function  changes  as 
the  value  of  the  independent  variable  changes.  Intuitively,  it  tells  us  how  “steep”  the  graph  of  the  function 
is.  We  might  wonder  if  there  is  a  similar  idea  for  graphs  of  functions  of  two  variables,  that  is,  surfaces.  It 
is  not  clear  that  this  has  a  simple  answer,  nor  how  we  might  proceed.  We  will  start  with  what  seem  to  be 
very  small  steps  toward  the  goal.  Surprisingly,  it  turns  out  that  these  simple  ideas  hold  the  keys  to  a  more 
general  understanding. 
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Figure  13.5:  The  intersection  of  a  plane  x+y  —  1  and  the  surface  f(x,y )  —  x2+y2 


The  derivative  of  a  single-variable  function  f(x)  tells  us  how  much  f(x)  changes  as  x  increases.  The 
obvious  analogue  for  a  function  of  two  variables  g(x,y)  would  be  something  that  tells  us  how  quickly 
g(x,y)  increases  as  x  and  y  increase.  However,  in  most  cases  this  will  depend  on  how  quickly  x  and  y  are 
changing  relative  to  each  other. 


Solution.  If  we  look  at  a  point  (x,y,y2)  on  this  surface,  the  value  of  a  function  does  not  change  at  all  if  we 
fix  y  and  let  x  increase,  but  increases  like  y2  if  we  fix  x  and  let  y  increase.  X 

Now  let  us  consider  what  happens  to  /(x,y)  when  both  x  and  y  are  increasing,  perhaps  at  different 
rates.  We  can  think  of  this  as  being  a  movement  in  a  certain  direction  of  a  point  in  the  x,y-plane.  A  point 
and  a  direction  defines  a  line  in  the  x,y-plane,  and  so  we  are  asking  how  the  function  changes  as  we  move 
along  this  line. 

Let  us  then  imagine  a  plane  perpendicular  to  the  x,y-plane  that  intersects  the  the  x,y-plane  along  this 
line.  This  plane  will  intersect  the  surface  of  /  in  a  curve,  so  we  can  just  look  at  the  behaviour  of  this  curve 
in  the  given  plane. 

Figure  13.5  shows  the  plane  x  +  y  —  1,  which  is  the  plane  perpendicular  to  the  line  x  +  y  —  1  in  the 
x,y-plane.  Observe  that  its  intersection  with  the  surface  of  /  is  a  curve,  in  fact,  a  parabola.  We  will  refer 
to  such  a  curve  as  the  cross-section  of  the  surface  above  the  line  in  the  x,  y-plane. 

We  can  now  look  at  the  rate  of  change  (or  slope)  of  /  in  a  particular  direction  by  looking  at  the  slope 
of  a  curve  in  a  plane  —  something  we  already  have  experience  with. 

Let’s  start  by  looking  at  some  particularly  easy  lines:  Those  parallel  to  the  x  or  y  axis.  Suppose  we 
are  interested  in  the  cross-section  of  /(x,y)  above  the  line  y  =  b.  If  we  substitute  b  for  y  in  /(x,y),  we  get 
a  function  in  one  variable,  describing  the  height  of  the  cross-section  as  a  function  of  x.  Because  y  —  b  is 
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parallel  to  the  x-axis,  if  we  view  it  from  a  vantage  point  on  the  negative  y-axis,  we  will  see  what  appears 
to  be  simply  an  ordinary  curve  in  the  x-z  plane. 


Figure  13.6:  f(x,y)  —  x2  +y2,  cut  by  the  plane  y  —  2 


Consider  again  the  parabolic  surface  f(x,y )  =  x2  +  y2.  The  cross-section  above  the  line  y  —  2  consists 
of  all  points  (x,2,x2  +4).  Looking  at  this  cross-section  we  see  what  appears  to  be  just  the  curve  /(x)  = 
x2  +4.  At  any  point  on  the  cross-section,  (a,  2,  a2  +  4),  the  slope  of  the  surface  in  the  direction  of  the  line 
y  —  2  is  simply  the  slope  of  the  curve  /(x)  =  x2  +  4,  namely  2x.  Figure  13.6  shows  the  same  parabolic 
surface  as  before,  but  now  cut  by  the  plane  y  =  2.  The  left  graph  shows  the  cut-off  surface,  the  right  shows 
just  the  cross-section. 

If,  for  example,  we’re  interested  in  the  point  (  —  1,2,5)  on  the  surface,  then  the  slope  in  the  direction 
of  the  line  y  =  2  is  2x  =  2(  —  1)  =  —  2.  This  means  that  starting  at  (  —  1,2,5)  and  moving  on  the  surface, 
above  the  line  y  —  2,  in  the  direction  of  increasing  x  values,  the  surface  goes  down;  of  course  moving  in 
the  opposite  direction,  toward  decreasing  x  values,  the  surface  will  rise. 

If  we’re  interested  in  some  other  line  y  —  k,  there  is  really  no  change  in  the  computation.  The  equation 
of  the  cross-section  above  y  —  k  is  x2  +  k2  with  derivative  2x.  We  can  save  ourselves  the  effort,  small  as  it 
is,  of  substituting  k  for  y:  all  we  are  in  effect  doing  is  temporarily  assuming  that  y  is  some  constant.  With 
this  assumption,  the  derivative  ^(x2  +  y2)  =  2x.  To  emphasize  that  we  are  only  temporarily  assuming 
y  is  constant,  we  use  a  slightly  different  notation:  J^(x2  +  y2)  =  2x;  the  “<9”  reminds  us  that  there  are 
more  variables  than  x,  but  that  only  x  is  being  treated  as  a  variable.  We  read  the  equation  as  “the  partial 
derivative  of  (x2  -t-y2)  with  respect  to  x  is  2x.”  A  convenient  alternate  notation  for  the  partial  derivative  of 
/(x,y)  with  respect  to  x  is  is  fx(x,y). 


Example  13.11:  Partial  Derivative  with  respect  to  x 


Find  the  partial  derivative  with  respect  tox  ofx 3  +  3xy. 


Solution.  The  partial  derivative  with  respect  to  x  of  x3  +  3xy  is  3x2  +  3 y.  Note  that  the  partial  derivative 
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includes  the  variable  y,  unlike  the  example  x2  +  y2.  It  is  somewhat  unusual  for  the  partial  derivative  to 
depend  on  a  single  variable;  this  example  is  more  typical.  & 

Of  course,  we  can  do  the  same  sort  of  calculation  for  lines  parallel  to  the  y-axis.  We  temporarily  hold 
x  constant,  which  gives  us  the  equation  of  the  cross-section  above  a  line  x  —  k.  We  can  then  compute  the 
derivative  with  respect  to  y;  this  will  measure  the  slope  of  the  curve  in  the  y  direction. 


Example  13.12:  Partial  Derivative  with  respect  to  y 


Find  the  partial  derivative  with  respect  to  y  of  f(x,y)  =  sin(xy)  +  3xy. 


Solution  The  partial  derivative  with  respect  to  y  of  f(x,y )  =  sin(xy)  +  3xy  is 

fy(x,y )  =  sin(vy)  +  3  xy  =  cos(vy)  (xy)  +  3x  —  vcos(xy)  +  3x. 


So  far,  using  no  new  techniques,  we  have  succeeded  in  measuring  the  slope  of  a  surface  in  two  quite 
special  directions.  For  functions  of  one  variable,  the  derivative  is  closely  linked  to  the  notion  of  tangent 
line.  For  surfaces,  the  analogous  idea  is  the  tangent  plane — a  plane  that  just  touches  a  surface  at  a  point, 
and  has  the  same  slope  as  the  surface  in  all  directions.  Even  though  we  haven’t  yet  figured  out  how  to 
compute  the  slope  in  all  directions,  we  have  enough  information  to  find  tangent  planes.  Suppose  we  want 
the  plane  tangent  to  a  surface  at  a  particular  point  ( a,b,c ).  If  we  compute  the  two  partial  derivatives  of 
the  function  for  that  point,  we  get  enough  information  to  determine  two  lines  tangent  to  the  surface,  both 
through  ( a,b,c )  and  both  tangent  to  the  surface  in  their  respective  directions.  These  two  lines  determine 
a  plane,  that  is,  there  is  exactly  one  plane  containing  the  two  lines:  the  tangent  plane.  Figure  13.7  shows 
(part  of)  two  tangent  lines  at  a  point,  and  the  tangent  plane  containing  them. 


Figure  13.7:  Tangent  vectors  and  tangent  plane. 

How  can  we  discover  an  equation  for  this  tangent  plane?  We  know  a  point  on  the  plane,  ( a,b,c );  we 
need  a  vector  normal  to  the  plane.  If  we  can  find  two  vectors,  one  parallel  to  each  of  the  tangent  lines  we 
know  how  to  find,  then  the  cross  product  of  these  vectors  will  give  the  desired  normal  vector. 
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z 


Figure  13.8:  A  tangent  vector. 


How  can  we  find  vectors  parallel  to  the  tangent  lines?  Consider  first  the  line  tangent  to  the  surface 
above  the  line  y  =  b.  A  vector  ( u,v,w )  parallel  to  this  tangent  line  must  have  y  component  v  =  0,  and  we 
may  as  well  take  the  x  component  to  be  u  —  1 .  The  ratio  of  the  z  component  to  the  x  component  is  the  slope 
of  the  tangent  line,  precisely  what  we  know  how  to  compute.  The  slope  of  the  tangent  line  is  fx(a,b),  so 


In  other  words,  a  vector  parallel  to  this  tangent  line  is  (1,0 ,fx(a,b)),  as  shown  in  Figure  13.8.  If  we  repeat 
the  reasoning  for  the  tangent  line  above  x  —  a,  we  get  the  vector  (0, 1  ,fy(a,b)). 

Now  to  find  the  desired  normal  vector  we  compute  the  cross  product,  (0,l,fy)x(l,0,fx)  —  (fx,fy,  —  l). 
From  our  earlier  discussion  of  planes,  we  can  write  down  the  equation  we  seek:  fx{a,b)x  +  fy(a,b)y  —  z  = 
k,  and  k  as  usual  can  be  computed  by  substituting  a  known  point:  fx{a,b)(a)  +  fy(a,b)(b)  —  c  —  k.  There 
are  various  more-or-less  nice  ways  to  write  the  result: 

fx(a,  b)x  +  fy  ( a ,  b)y~z  =  fx{a,  b)a  +  fy  (a,  b)b-c 
fx{a, b)x  +  fy ( a ,  b)y  -  fx(a, b)a  -  fy(a,b)b  +  c  =  z 
fx(a,b)(x-a)+fy(a,b)(y-b)  +  c  =  z 
fx{a,  b )  (x-a)+  fy(a,  b )  (y  ~b)+  f(a,  b)  =  z 


Solution.  The  point  (1,1, a/2)  is  on  the  upper  hemisphere,  so  we  use  f(x,y )  =  \/ 4  — x2— y2.  Then 
fx(x,y)  =  —x(4  —  x2  —  y2)  1//2  and  fy(x,y)  =  -y(4 -x2  -y2)_1/2,  so  /*(1,1)  =  fy(  1,1)  =  -\/>/2  and 
the  equation  of  the  plane  is 


z 


1 

71 


C*-1) 


-±{y-1)  +  V2. 


The  hemisphere  and  this  tangent  plane  are  pictured  in  Figure  13.7. 


X 
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So  it  appears  that  to  find  a  tangent  plane,  we  need  only  find  two  quite  simple  ordinary  derivatives, 
namely  fx  and  fy.  This  is  true  if  the  tangent  plane  exists.  It  is,  unfortunately,  not  always  the  case  that  if 
fx  and  fy  exist  there  is  a  tangent  plane.  Consider  the  function  xy2 / (x2  +y4)  with  /(0,0)  defined  to  be  0, 
pictured  in  Figure  13.3.  This  function  has  value  0  when  x  —  0  or  y  —  0.  Now  it’s  clear  that  fx( 0,0)  = 
fy (0, 0)  =  0,  because  in  the  x  and  y  directions  the  surface  is  simply  a  horizontal  line.  But  it’s  also  clear 
from  the  picture  that  this  surface  does  not  have  anything  that  deserves  to  be  called  a  tangent  plane  at  the 
origin,  certainly  not  the  x-y  plane  containing  these  two  tangent  lines. 

When  does  a  surface  have  a  tangent  plane  at  a  particular  point?  What  we  really  want  from  a  tangent 
plane,  as  from  a  tangent  line,  is  that  the  plane  be  a  “good”  approximation  of  the  surface  near  the  point. 
Here  is  how  we  can  make  this  precise: 


This  definition  takes  a  bit  of  absorbing.  Let’s  rewrite  the  central  equation  a  bit: 

z  =  fx(xo,yo)(x  -  xq)  +  fy(xo,yo)(y  -  yo)  +  f(xo,yo)  +  £\Ax  +  e2Ay.  (13.1) 

The  first  three  terms  on  the  right  are  the  equation  of  the  tangent  plane,  that  is, 

fx(xo,yo)(x-x0)+fy(xo,yo)(y-yo)+f(xo,y0) 

is  the  z- value  of  the  point  on  the  plane  above  (x,y).  Equation  13.1  says  that  the  z- value  of  a  point  on  the 
surface  is  equal  to  the  z-value  of  a  point  on  the  plane  plus  a  “little  bit,”  namely  £\Ax  +  £2 Ay.  As  (x,y) 
approaches  (xo,yo),  both  Ax  and  Ay  approach  0,  so  this  little  bit  £j  Ax  +  £2 Ay  also  approaches  0,  and  the 
z-values  on  the  surface  and  the  plane  get  close  to  each  other.  But  that  by  itself  is  not  very  interesting: 
since  the  surface  and  the  plane  both  contain  the  point  (xo,yo,zo),  the  z  values  will  approach  zo  and  hence 
get  close  to  each  other  whether  the  tangent  plane  is  “tangent”  to  the  surface  or  not.  The  extra  condition  in 
the  definition  says  that  as  (x,y)  approaches  (xo,yo),  the  £  values  approach  0 — this  means  that  £iAx  +  £2Ay 
approaches  0  much,  much  faster,  because  £iAx  is  much  smaller  than  either  £\  or  Av.  It  is  this  extra 
condition  that  makes  the  plane  a  tangent  plane. 

We  can  see  that  the  extra  condition  on  £1  and  £2  is  just  what  is  needed  if  we  look  at  partial  derivatives. 
Suppose  we  temporarily  fix  y  =  yo,  so  Ay  =  0.  Then  the  equation  from  the  definition  becomes 

Az  =  /v(xo,yo)Ax  +  £1  Ax 


or 

~^x=  fx(xo,yo)+£i- 

Now  taking  the  limit  of  the  two  sides  as  Ax  approaches  0,  the  left  side  turns  into  the  partial  derivative  of  z 
with  respect  to  x  at  (xo,yo),  or  in  other  words  fx{xo,yo),  and  the  right  side  does  the  same,  because  as  (x,y) 
approaches  (xo,yo),  £1  approaches  0.  Essentially  the  same  calculation  works  for  fy. 
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Exercises  for  13.3 


Exercise  13.3.1 
Exercise  13.3.2 

Exercise  13.3.3 
Exercise  13.3.4 
Exercise  13.3.5 
Exercise  13.3.6 


Find  fx  and  fy  where  f(x,y ) 

Find  fx  and  fy  where  f(x,y ) 

Find  fx  and  fy  where  f(x,y) 

Find  fx  and  fy  where  f(x,y)  —  xyln(xy). 

Find  fx  and  fy  where  f(x,y)  —  \J  1  —  A'2  —y2. 
Find  fx  and  fy  where  f(x,y )  =  xtan(y). 


=  cos(x2v)  +  y3. 

=  xy 
x2+y' 

=  ex2+y2. 


Exercise  13.3.7  Find  fx  and  fy  where  f(x,y) 


1 

xy‘ 


Exercise  13.3.8  Find  an  equation  for  the  plane  tangent  to  2x2  +  3  v2  —  z2  —4  at  (1, 1,-1). 

Exercise  13.3.9  Find  an  equation  for  the  plane  tangent  to  f(x,y )  =  sin(xy)  at  (Jt,  1/2, 1). 

Exercise  13.3.10  Find  an  equation  for  the  plane  tangent  to  f{x,y)  —  x2  +  y3  at  (3, 1, 10). 

Exercise  13.3.11  Find  an  equation  for  the  plane  tangent  to  f(x,y)  —  x\n  (xy)  at  (2, 1/2,0). 

Exercise  13.3.12  Find  an  equation  for  the  line  normal  to  x2  +  4y2  =  2  z  at  (2, 1,4). 

Exercise  13.3.13  Explain  in  your  own  words  why,  when  taking  a  partial  derivative  of  a  function  of  multiple 
variables,  we  can  treat  the  variables  not  being  differentiated  as  constants. 


Exercise  13.3.14  Consider  a  differentiable  function,  f(x,y).  Give  physical  interpretations  of  the  meanings 
offx(a,b)  and  fy(a,b)  as  they  relate  to  the  graph  of  f. 

Exercise  13.3.15  In  much  the  same  way  that  we  used  the  tangent  line  to  approximate  the  value  of  a 
function  from  single  variable  calculus,  we  can  use  the  tangent  plane  to  approximate  a  function  from  mul¬ 
tivariable  calculus.  Consider  the  tangent  plane  found  in  Exercise  13.3.11.  Use  this  plane  to  approximate 
/( 1.98,0.4). 

Exercise  13.3.16  The  volume  of  a  cylinder  is  given  by  V  —  nr2h.  Suppose  that  the  current  values  ofr  and 
h  are  r  —  1  cm  and  h  =  3  cm.  Is  the  volume  more  sensitive  to  a  small  change  in  radius  or  the  same  amount 
of  change  in  height?  Why? 
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Exercise  13.3.17  Suppose  that  one  of  your  colleagues  has  calculated  the  partial  derivatives  of  a  given 
function,  and  reported  to  you  that  fx{x,y )  =  2x  +  3y  and  that  fy(x,y )  =  4x  +  6y.  Do  you  believe  them? 
Why  or  why  not?  If  not,  what  answer  might  you  have  accepted  for  fy  ? 

Exercise  13.3.18  Suppose  f(t )  and  g(t)  are  single  variable  differentiable  functions.  Find  dz/dx  and 
dz/dy  for  each  of  the  following  two  variable  functions. 

(a)  z  =  f{x)g{y) 

(b)  z  =  f(xy) 

(c)  z  =  f{x/y) 


13.4  The  Chain  Rule 


Consider  the  surface  z  —  x2y  +  xy2,  and  suppose  that  x  —  2  +  t4  and  y  =  1  —  t3.  We  can  think  of  the  latter 
two  equations  as  describing  how  x  and  y  change  relative  to,  say,  time.  Then 

z  =  x2y  +  xy2  =  (2  + t4)2  ( 1  - 13)  +  (2  + 14)  ( 1  - t3)2 

tells  us  explicitly  how  the  z  coordinate  of  the  corresponding  point  on  the  surface  depends  on  t.  If  we  want 
to  know  dz/dt  we  can  compute  it  more  or  less  directly,  but  it’s  actually  a  bit  simpler  to  use  product  and 
chain  rules: 


— -  =  x2y'  +  2 xx'y  +  x2yy  +  xy2 
dt 

=  (2 xy+y2)x  +  (x2  +  2 xy)y 

—  (2(2  +  f4)(l  —t3)  +  (1  —t3)2)(4t3)  +  ((2  +  t4)2  +  2(2  +  t4)(l  — 13))(— 3t2) 


If  we  look  carefully  at  the  middle  step,  dz/dt  —  (2xy  +  y2)x'  +  (x2  +  2xy)y' ,  we  notice  that  2xv  +  y2  is 
dz/ dx,  and  x2  +  2xy  is  dzf  dy.  This  turns  out  to  be  true  in  general,  and  gives  us  a  new  chain  rule: 


Proof.  If  /  is  differentiable,  then 


Az  =  A(x0,yo)Ax  +  /y(xo,yo)Ay  +  eiAx  +  e2Ay, 
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where  £]  and  £2  approach  0  as  (x,  y)  approaches  (xo,yo)-  Then 


Az 

At 


„  Ax  Ay  Ax  Ay 

f^+f!'At+£lyt+e2yt- 


As  At  approaches  0,  (x,y)  approaches  (xo,yo)  and  so 


Az  dz 
lim  —  =  — 
Af— >0  At  dt 


lim  £1 

Af— >0 


Ax 

At 


0- 


dx 

dt 


v 

lim  £2  — 
Af— >0  At 


0- 


dy 

dt 


(13.2) 


and  so  taking  the  limit  of  (13.2)  as  At  goes  to  0  gives 


dz 

dt 


hdthdt' 


as  desired.  £ 

We  can  write  the  chain  rule  in  a  way  that  is  somewhat  closer  to  the  single  variable  chain  rule: 

^  =  (fx,fy)  ■  (x',y'), 

or  (roughly)  the  derivatives  of  the  outside  function  “times”  the  derivatives  of  the  inside  functions.  Not 
surprisingly,  essentially  the  same  chain  rule  works  for  functions  of  more  than  two  variables,  for  example, 
given  a  function  of  three  variables  f(x,y,z),  where  each  of  x,  y  and  z  is  a  function  of  t, 

d£  =  (fxJyJz)-(^,y',z). 

We  can  even  extend  the  idea  further.  Suppose  that  f(x,y )  is  a  function  and  x  =  g(s,t)  and  y  =  h(s,t) 
are  functions  of  two  variables  s  and  t.  Then  /  is  “really”  a  function  of  s  and  t  as  well,  and 

df  df 

=  fxgs  +  fyhs  -fa  =  fxgt  +  fyht . 

The  natural  extension  of  this  to  f(x,y,z )  works  as  well. 

Recall  that  we  used  the  ordinary  chain  rule  to  do  implicit  differentiation.  We  can  do  the  same  with  the 
new  chain  rule. 


Solution.  The  equation  x2  +  y2  +  z2  —  4  defines  a  sphere,  which  is  not  a  function  of  x  and  y,  though  it  can 
be  thought  of  as  two  functions,  the  top  and  bottom  hemispheres.  We  can  think  of  z  as  one  of  these  two 
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functions,  so  really  z  —  z(x,y),  and  we  can  think  of  x  and  y  as  particularly  simple  functions  of  x  and  y,  and 
let  f(x,y,z )  =  x2  +y2  +  z2.  Since  f(x,y,z )  =  4,  df/dx  =  0,  but  using  the  chain  rule: 


0 


df 

dx 


+  fz 


dz 

dx 


(2*)(l)  +  (2y)(0)  +  (2J)|? 


noting  that  since  y  is  temporarily  held  constant  its  derivative  By/ dx  =  0.  Now  we  can  solve  for  dz/ dx: 

dz  2x  x 

dx  2  z  z' 

In  a  similar  manner  we  can  compute  dz/ dy.  X 


Exercises  for  13.4 


Exercise  13.4.1  Use  the  chain  rule  to  compute  dz/dt  for  z  —  sin(x2  +  y2),  x  —  t2  +  3,  y  —  f\ 

Exercise  13.4.2  Use  the  chain  rule  to  compute  dz/dt  for  z  —  x2y,  x  —  sin(f),  y  =  t2+  1. 

Exercise  13.4.3  Use  the  chain  rule  to  compute  dz/ ds  and  dz/ dt  for  z  —  x2y,  x  —  sin(.v/),  y  —  t2  +  s2. 

Exercise  13.4.4  Use  the  chain  rule  to  compute  dz/ ds  and  dz/ dt  for  z  —  x2y2,  x  —  st,  y  —  t2  —  s2. 

Exercise  13.4.5  Use  the  chain  rule  to  compute  dz/ dx  and  dz/ dyfor  2x2  +  3 y2  —  2 z2  =  9. 

Exercise  13.4.6  Use  the  chain  rule  to  compute  dz/ dx  and  dz/ dyfor  2x2  +  y2  +  z2  =  9. 

Exercise  13.4.7  Chemistry  students  will  recognize  the  ideal  gas  law,  given  by  PV  —  nRT  which  relates  the 
Pressure,  Volume,  and  Temperature  of  n  moles  of  gas.  (R  is  the  ideal  gas  constant).  Thus,  we  can  view 
pressure,  volume,  and  temperature  as  variables,  each  one  dependent  on  the  other  two. 

(a)  If  pressure  of  a  gas  is  increasing  at  a  rate  of0.2Pa/min  and  temperature  is  increasing  at  a  rate  of 
\K/min,  how  fast  is  the  volume  changing? 

(b)  If  the  volume  of  a  gas  is  decreasing  at  a  rate  of0.3L/min  and  temperature  is  increasing  at  a  rate  of 
.5K/min,  how  fast  is  the  pressure  changing? 

(c)  If  the  pressure  of  a  gas  is  decreasing  at  a  rate  ofOAPa/min  and  the  volume  is  increasing  at  a  rate 
of  3  L/ min,  how  fast  is  the  temperature  changing? 
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Exercise  13.4.8  Verify  the  following  identity  in  the  case  of  the  ideal  gas  law: 

dPdVdT  _  j 
dVdTdP 


Exercise  13.4.9  The  previous  exercise  was  a  special  case  of  the  following  fact,  which  you  are  to  verify 
here:  If  F (x, y,  z.)  is  a  function  of  3  variables,  and  the  relation  F(x,y,z )  =  0  defines  each  of  the  variables 
in  terms  of  the  other  two,  namely  x  =  f(y,z),  V  —  g(x,z )  and  z  —  h(x,y),  then 

dx  dy  dz  _ 
dy  dz  dx 


13.5  Directional  Derivatives 


We  still  have  not  answered  one  of  our  first  questions  about  the  steepness  of  a  surface:  starting  at  a  point 
on  a  surface  given  by  f(x,y),  and  walking  in  a  particular  direction,  how  steep  is  the  surface?  We  are  now 
ready  to  answer  the  question. 

We  already  know  roughly  what  has  to  be  done:  as  shown  in  Figure  13.5,  we  extend  a  line  in  the  x-y 
plane  to  a  vertical  plane,  and  we  then  compute  the  slope  of  the  curve  that  is  the  cross-section  of  the  surface 
in  that  plane.  The  major  stumbling  block  is  that  what  appears  in  this  plane  to  be  the  horizontal  axis,  namely 
the  line  in  the  x-y  plane,  is  not  an  actual  axis — we  know  nothing  about  the  “units”  along  the  axis.  Our  goal 
is  to  make  this  line  into  a  t  axis;  then  we  need  formulas  to  write  x  and  y  in  terms  of  this  new  variable  t\ 
then  we  can  write  z  in  terms  of  t  since  we  know  z  in  terms  of  x  and  y;  and  finally  we  can  simply  take  the 
derivative. 

So  we  need  to  somehow  “mark  off”  units  on  the  line,  and  we  need  a  convenient  way  to  refer  to  the 
line  in  calculations.  It  turns  out  that  we  can  accomplish  both  by  using  the  vector  form  of  a  line.  Suppose 
that  u  is  a  unit  vector  (wj,  M2)  in  the  direction  of  interest.  A  vector  equation  for  the  line  through  (xoAo)  in 
this  direction  is  x(t)  —  (up+x o,  U2t  +yo)-  The  height  of  the  surface  above  the  point  (uit+x o,  U2t  +yo)  is 
g(t)  —  f(u\t+xo,U2t+yo)-  Because  u  is  a  unit  vector,  the  value  of  t  is  precisely  the  distance  along  the 
line  from  (xo,Vo)  to  {u\t  +xo,U2t  +  yo);  this  means  that  the  line  is  effectively  a  t  axis,  with  origin  at  the 
point  (xo,yo)> so  the  slope  we  seek  is 

g'{ 0)  =  (fx(xo,yo),  fy(xo,yo))  ■  (uhu2) 

=  ifxjy)'  U 

=  V/-U 


Here  we  have  used  the  chain  rule  and  the  derivatives  ^ (uit+x o)  =  u\  and  fill'll  +yo)  =  «2-  The  vector 
(fx,fy)  is  very  useful,  so  it  has  its  own  symbol,  V/,  pronounced  “del  f”;  it  is  also  called  the  gradient  of  /. 
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Solution.  We  first  compute  the  gradient  at  (1,2):  V/  =  (2x,2y),  which  is  (2,4)  at  (1,2).  A  unit  vector  in 
the  desired  direction  is  (3/5, 4/5),  and  the  desired  slope  is  then  (2,4)  •  (3/5, 4/5)  —  6/5  H-  16/5  =  22/5. 

* 


Example  13.18:  Tangent  Vector 


Find  a  tangent  vector  to  z  —  x2  +y2  at  ( 1,2)  in  the  direction  of  the  vector  (3,4)  and  show  that  it  is 
parallel  to  the  tangent  plane  at  that  point. 


Solution.  Since  (3/5, 4/5)  is  a  unit  vector  in  the  desired  direction,  we  can  easily  expand  it  to  a  tangent 
vector  simply  by  adding  the  third  coordinate  computed  in  the  previous  example:  (3/5,4/5,22/5).  To  see 
that  this  vector  is  parallel  to  the  tangent  plane,  we  can  compute  its  dot  product  with  a  normal  to  the  plane. 
We  know  that  a  normal  to  the  tangent  plane  is 


(/*(l>2),/y(l,2),  —  1)  =  (2,4,  — 1), 


and  the  dot  product  is  (2,4, —  1)  •  (3/5,4/5,22/5)  =  6/5  +  16/5  —  22/5  =  0,  so  the  two  vectors  are  per¬ 
pendicular.  (Note  that  the  vector  normal  to  the  surface,  namely  (fx,fy,  —  l),  is  simply  the  gradient  with  a 
—  1  tacked  on  as  the  third  component.)  X 

The  slope  of  a  surface  given  by  z  —  f(x,y )  in  the  direction  of  a  (two-dimensional)  vector  u  is  called  the 
directional  derivative  of  /,  written  Duf.  The  directional  derivative  immediately  provides  us  with  some 
additional  information.  We  know  that 

Duf  =  V/  ■  u  =  |  V/|  |u|  cos  0  —  |V/|  cos  9 

if  u  is  a  unit  vector;  9  is  the  angle  between  V/  and  u.  This  tells  us  immediately  that  the  largest  value  of 
Duf  occurs  when  cos  0  =  1,  namely,  when  0  =  0,  so  V/  is  parallel  to  u.  In  other  words,  the  gradient  V/ 
points  in  the  direction  of  steepest  ascent  of  the  surface,  and  |V/|  is  the  slope  in  that  direction.  Likewise, 
the  smallest  value  of  Duf  occurs  when  cos  0  =  —  1,  namely,  when  9  —  K.  so  V/  is  anti-parallel  to  u.  In 
other  words,  —V/  points  in  the  direction  of  steepest  descent  of  the  surface,  and  —  |V/|  is  the  slope  in  that 
direction. 


Example  13.19:  Direction  of  Steepest  Ascent  and  Descent 


Investigate  the  direction  of  steepest  ascent  and  descent  for  z  =  x2  +  y2 . 


Solution.  The  gradient  is  (2x,  2y)  —  2  (x,y);  this  is  a  vector  parallel  to  the  vector  (x,y),  so  the  direction 
of  steepest  ascent  is  directly  away  from  the  origin,  starting  at  the  point  (x,y).  The  direction  of  steepest 
descent  is  thus  directly  toward  the  origin  from  (x,y).  Note  that  at  (0,0)  the  gradient  vector  is  (0,0),  which 
has  no  direction,  and  it  is  clear  from  the  plot  of  this  surface  that  there  is  a  minimum  point  at  the  origin, 
and  tangent  vectors  in  all  directions  are  parallel  to  the  x-y  plane.  X 
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If  V/  is  perpendicular  to  u,  Duf  —  | V/|  cos(7t/2)  =  0,  since  cos(7t/2)  =  0.  This  means  that  in  either 
of  the  two  directions  perpendicular  to  V/,  the  slope  of  the  surface  is  0;  this  implies  that  a  vector  in  either 
of  these  directions  is  tangent  to  the  level  curve  at  that  point.  Starting  with  V/  =  (fx,fy),  it  is  easy  to  find 
a  vector  perpendicular  to  it:  either  ( fy ,  —fx)  or  {—fy,fx)  will  work. 

If  f(x,y,z )  is  a  function  of  three  variables,  all  the  calculations  proceed  in  essentially  the  same  way.  The 
rate  at  which  /  changes  in  a  particular  direction  is  V/-u,  where  now  V/=  (fx,fy,fz)  andu=  {u\,  112,1*3)  is 
a  unit  vector.  Again  V/  points  in  the  direction  of  maximum  rate  of  increase,  — V/  points  in  the  direction  of 
maximum  rate  of  decrease,  and  any  vector  perpendicular  to  V/  is  tangent  to  the  level  surface  f(x,y,z)  —  k 
at  the  point  in  question.  Of  course  there  are  no  longer  just  two  such  vectors;  the  vectors  perpendicular  to 
V/  describe  the  tangent  plane  to  the  level  surface,  or  in  other  words  V/  is  a  normal  to  the  tangent  plane. 


Example  13.20:  Gradient 


Suppose  the  temperature  at  a  point  in  space  is  given  by  T(x,y,z )  =  Tq/( I  A x2  +  y2  +  z1);  at  the 
origin  the  temperature  in  Kelvin  is  Tq  >  0,  and  it  decreases  in  every  direction  from  there.  It  might 
be,  for  example,  that  there  is  a  source  of  heat  at  the  origin,  and  as  we  get  farther  from  the  source, 
the  temperature  decreases.  The  gradient  is 

yy,  /  ~2TqX  ~2TqX  ~2TqX  . 

(1  +x2+y2  +  z2)2  (1  +x2+y2  +  z2)2  (1  +x2+y2  +  z2)2 

-27b  ,  , 

(l+x2+y2+z2)2[X’y’Z}- 

The  gradient  points  directly  at  the  origin  from  the  point  (x,y,z) — by  moving  directly  toward  the 
heat  source,  we  increase  the  temperature  as  quickly  as  possible. 


Example  13.21:  Tangent  Plane 


Find  the  points  on  the  surface  defined  by  x2  +  2 y2  +  3 z2  —  1  where  the  tangent  plane  is  parallel  to 
the  plane  defined  by3x  —  y  +  3z  —  1 . 


Solution.  Two  planes  are  parallel  if  their  normals  are  parallel  or  anti-parallel,  so  we  want  to  find  the  points 
on  the  surface  with  normal  parallel  or  anti-parallel  to  (3,  — 1,3).  Let  /  =  x2  +  2_v2  +  3z2',  the  gradient  of 
/  is  normal  to  the  level  surface  at  every  point,  so  we  are  looking  for  a  gradient  parallel  or  anti-parallel  to 
(3,  —  1 , 3) .  The  gradient  is  (2x,  4 y,  6 z) ;  if  it  is  parallel  or  anti-parallel  to  (3,  —  1 , 3) ,  then 

(2x,Ay,6z)  —k(  3, -1,3) 

for  some  k.  This  means  we  need  a  solution  to  the  equations 

2x  =  3k  Ay  —  —k  6  z  —  3k 

but  this  is  three  equations  in  four  unknowns — we  need  another  equation.  What  we  haven’t  used  so  far  is 
that  the  points  we  seek  are  on  the  surface  x2  +  2y2  +  3 z2  —  1 ;  this  is  the  fourth  equation.  If  we  solve  the 
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first  three  equations  for  x,  y,  and  z  and  substitute  into  the  fourth  equation  we  get 

=  (l  +  T6  +  l)kl 


,  ,  2V2  ,  .  (3V2  V2  V2\  f  3V2  V2  y/2' 

so  k  —  ±— — .  The  desired  points  are  |  — — -r7r,-r-  I  and 
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Exercises  for  13.5 


Exercise  13.5.1  Find  Duf  for  f  =  x2  +xy  -\-y2  in  the  direction  ofu  =  (2, 1)  at  the  point  (1, 1). 

Exercise  13.5.2  Find  Duf  for  f  —  sin(xy)  in  the  direction  ofu  —  (—1, 1)  at  the  point  (3, 1). 

Exercise  13.5.3  Find  Duf  for  f  =  ex  cos(y)  in  the  direction  30  degrees  from  the  positive  x  axis  at  the  point 
(1,tt/4). 

Exercise  13.5.4  The  temperature  of a  thin plate  in  the  x-y  plane  is  T  —  x  +y  .  How  fast  does  temperature 
change  at  the  point  (1,5)  moving  in  a  direction  30  degrees  from  the  positive  x  axis? 

Exercise  13.5.5  Suppose  the  density  of  a  thin  plate  at  (x,y)  is  1/ \J x2  +  y2  +  1 .  Find  the  rate  of  change  of 
the  density  at  (2, 1)  in  a  direction  n/3  radians  from  the  positive  x  axis. 

Exercise  13.5.6  Suppose  the  electric  potential  at  (x,y)  is  1  n  \Jx2  +y2.  Find  the  rate  of  change  of  the 
potential  at  (3,4)  toward  the  origin  and  cdso  in  a  direction  at  a  right  angle  to  the  direction  toward  the 
origin. 

Exercise  13.5.7  A  plane  perpendicular  to  the  x-y  plane  contains  the  point  (2,1,8)  on  the  paraboloid 
z  —  x2  +4y2.  The  cross-section  of  the  paraboloid  created  by  this  plane  has  slope  0  at  this  point.  Find  an 
equation  of  the  plane. 

Exercise  13.5.8  A  plane  perpendicular  to  the  x-y  plane  contains  the  point  (3,2,2)  on  the  paraboloid 
36 z  —  Ax2  +  9y2.  The  cross-section  of  the  paraboloid  created  by  this  plane  has  slope  0  at  this  point.  Find 
an  equation  of  the  plane. 

Exercise  13.5.9  Suppose  the  temperature  at  (x,y,z)  is  given  by  T  =xy  +  sin(yz).  In  what  direction  should 
you  go  from  the  point  (1,1,1)  to  decrease  the  temperature  as  quickly  as  possible?  What  is  the  rate  of 
change  of  temperature  in  this  direction? 
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Exercise  13.5.10  Suppose  the  temperature  at  (x,y,z)  is  given  by  T  —  xyz.  In  what  direction  can  you  go 
from  the  point  (1,1,1)  to  maintain  the  same  temperature? 

Exercise  13.5.11  Find  an  equation  for  the  plane  tangent  to  x  —  3  y1  +z2  =  7  at  (1,1,3). 

Exercise  13.5.12  Find  an  equation  for  the  plane  tangent  to  xyz  =  6  at  (1,2, 3). 

Exercise  13.5.13  Find  an  equation  for  the  line  normal  to  x2  +  2 y2  +  4  z2  —  26  at  (2,  -3,-1). 

Exercise  13.5.14  Find  an  equation  for  the  line  normal  to  x2  +  y2  +  9  z2  —  56  at  (4, 2,  —2). 

Exercise  13.5.15  Find  an  equation  for  the  line  normal  to  x2  +  5v2  —  z2  —  0  at  (4, 2, 6). 

Exercise  13.5.16  Find  the  directions  in  which  the  directional  derivative  of  f(x,y)  =  x2  +  sin(xy)  at  the 
point  (1,0)  has  the  value  1. 

Exercise  13.5.17  Show  that  the  curve  r(t)  —  (ln(/),/  ln(/),/)  is  tangent  to  the  surface  xz2  —yz  +  cos(xy)  =  1 
at  the  point  (0,0, 1). 

Exercise  13.5.18  A  bug  is  crawling  on  the  surface  of  a  hot  plate,  the  temperature  of  which  at  the  point  x 
units  to  the  right  of  the  lower  left  corner  and  y  units  up  from  the  lower  left  corner  is  given  by  T(x,y )  = 
100 -x2- 3  y3. 

(a)  If  the  bug  is  at  the  point  (2, 1),  in  what  direction  should  it  move  to  cool  off  the  fastest?  How  fast  will 
the  temperature  drop  in  this  direction? 

(b)  If  the  bug  is  at  the  point  (1,3),  in  what  direction  should  it  move  in  order  to  maintain  its  temperature? 

Exercise  13.5.19  The  elevation  on  a  portion  of  a  hill  is  given  by  f(x,y)  —  1 00  —  4x2  —  2 v.  From  the 
location  above  (2, 1),  in  which  direction  will  water  run? 

Exercise  13.5.20  Suppose  that  g(x,y)  =  y  —  x2.  Find  the  gradient  at  the  point  (  —  1,3).  Sketch  the  level 
cur\’e  to  the  graph  of  g  when  g(x,y)  —  2,  and  plot  both  the  tangent  line  and  the  gradient  vector  at  the  point 
(—1,3).  (Make  your  sketch  large).  What  do  you  notice,  geometrically? 

Exercise  13.5.21  The  gradient  V/  is  a  vector  valued  function  of  two  variables.  Prove  the  following 
gradient  rules.  Assume  f(x,y)  and  g(x,y)  are  differentiable  functions. 

(a)  V(/g)  =  /V(*)+*V(/) 

(b)  V(f/g)  =  ( gVf~fVg)/g 2 

(c)  V((/(x,y))")-«/(^D)"“1V/ 
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13.6  Higher  Order  Derivatives 


In  single  variable  calculus  we  saw  that  the  second  derivative  is  often  useful:  in  appropriate  circumstances 
it  measures  acceleration;  it  can  be  used  to  identify  maximum  and  minimum  points;  it  tells  us  something 
about  how  sharply  curved  a  graph  is.  Not  surprisingly,  second  derivatives  are  also  useful  in  the  multi- 
variable  case,  but  again  not  surprisingly,  things  are  a  bit  more  complicated. 

It’s  easy  to  see  where  some  complication  is  going  to  come  from:  with  two  variables  there  are  four 
possible  second  derivatives.  To  take  a  “derivative,”  we  must  take  a  partial  derivative  with  respect  to  x  or  y, 
and  there  are  four  ways  to  do  it:  x  then  x,  x  then  y,  y  then  x,  y  then  y. 


Example  13.22:  Second  Derivatives 


Compute  all  four  second  derivatives  of  f(x,y )  =  x2y2. 


Solution.  Using  an  obvious  notation,  we  get: 

fxx  =  2y  fxy  —  4xy  fyx  —  4xy  fyy  —  2x~. 


X 

You  will  have  noticed  that  two  of  these  are  the  same,  the  “mixed  partials”  computed  by  taking  partial 
derivatives  with  respect  to  both  variables  in  the  two  possible  orders.  This  is  not  an  accident — as  long  as 
the  function  is  reasonably  nice,  this  will  always  be  true. 


Theorem  13.23:  Clairaut’s  Theorem 


If  the  mixed  partial  derivatives  are  continuous,  they  are  equal. 


Example  13.24:  Mixed  Partials 


Compute  the  mixed  partials  of  f  —  xy/(x2  +y2). 


Solution.  The  mixed  partial  fxy  is  found  by  first  taking  the  partial  derivative  with  respect  to  x: 

_  y3-x2y 
h  (*2+y2)2’ 

then  with  respect  to  y: 

x4-6  x2y2+y4 

^  ( x2+y 2)3 


We  leave  fyx  as  an  exercise. 


X 
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Exercises  for  13.6 


Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 


13.6.1  Let  f  —  xy/ (x2  +y2);  compute  fxx,  fyx,  and  fyy. 

13.6.2  Find  all  first  and  second  partial  derivatives  ofx3y2+y5. 

13.6.3  Find  all  first  and  second  partied  derivatives  of  4a3  +xy2  +  10. 

13.6.4  Find  all  first  and  second  partial  derivatives  o/xsiny. 

13.6.5  Find  all  first  and  second  partial  derivatives  o/sin(3x)  cos(2y). 

13.6.6  Find  all  first  and  second  partial  derivatives  of  ex+y  . 

13.6.7  Find  all  first  and  second  partied  derivatives  of  In  y/x3  +  y4. 

13.6.8  Find  all  first  and  second  partial  derivatives  of  z  with  respect  to  x  and  y  if  x2  +  4 y2  + 


16z2  — 64  =  0. 


Exercise  13.6.9  Find  all  first  and  second  partial  derivatives  of  z  with  respect  to  x  and  y  ifxy+yz+xz  —  1. 

Exercise  13.6.10  Let  a  and  k  be  constants.  Prove  that  the  function  u(x,t)  —  e^ak2‘  sin  (Ax)  is  a  solution 
to  the  heat  equation  ut  —  cc2uxx 

Exercise  13.6.11  Let  a  be  a  constant.  Prove  that  u  —  sin(x  —  at  )  +  I  n  (x  +  at )  is  a  solution  to  the  wave 
equation  utt  —  a2uxx. 

Exercise  13.6.12  Flow  many  thirel-oreler  derivatives  does  a  function  of  2  variables  have?  How  many  of 
these  are  distinct? 

Exercise  13.6.13  How  many  nth  order  derivatives  does  a  function  of  2  variables  have?  How  many  of 
these  are  distinct? 


13.7  Maxima  and  Minima 


Suppose  a  surface  given  by  f(x,y)  has  a  local  maximum  at  (xo,yo+o);  geometrically,  this  point  on  the 
surface  looks  like  the  top  of  a  hill.  If  we  look  at  the  cross-section  in  the  plane  y  —  yo,  we  will  see  a  local 
maximum  on  the  curve  at  (xo,Zo)>  and  we  know  from  single- variable  calculus  that  |^  =  0  at  this  point. 

Likewise,  in  the  plane  x  =  xo,  =  0.  So  if  there  is  a  local  maximum  at  (xo,yo+o)>  both  partial  derivatives 
at  the  point  must  be  zero,  and  likewise  for  a  local  minimum.  Thus,  to  find  local  maximum  and  minimum 
points,  we  need  only  consider  those  points  at  which  both  partial  derivatives  are  0.  As  in  the  single-variable 
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case,  it  is  possible  for  the  derivatives  to  be  0  at  a  point  that  is  neither  a  maximum  or  a  minimum,  so  we 
need  to  test  these  points  further. 

You  will  recall  that  in  the  single  variable  case,  we  examined  three  methods  to  identify  maximum 
and  minimum  points;  the  most  useful  is  the  second  derivative  test,  though  it  does  not  always  work.  For 
functions  of  two  variables  there  is  also  a  second  derivative  test;  again  it  is  by  far  the  most  useful  test, 
though  it  doesn’t  always  work. 


Theorem  13.25:  Extrema  Test  for  Multivariate  Functions 


Suppose  that  the  second  partial  derivatives  of  f(x,y)  are  continuous  near  (xo,yo),  and  fx(xo,yo)  — 
fy(xo,yo)  =  0.  We  denote  by  D  the  discriminant  D(xo,yo)  =  fxx(xo,yo)fyy(xo,yo)  -  fxy{xo,yo)2 ■  If 
D  >  0  and  fxx(xo,yo)  <  0  there  is  a  local  maximum  at  (xo,yo);  ifD  >  0  and  fxx(x o,yo)  >  0  there  is  a 
local  minimum  at  (xoA’o) ;  if  D  <  0  there  is  neither  a  maximum  nor  a  minimum  at  (xo,yo);  if  D  =  0, 
the  test  fails. 


Solution.  First,  we  compute  all  the  needed  derivatives: 

fx  =  2x  fy  =  2  y  fxx  = 2  fyy  =  2  fxy  —  0. 

The  derivatives  fx  and  fy  are  zero  only  at  (0,0).  Applying  the  second  derivative  test  there: 

D(  0,0)  =  fxx{0,0)fyy(0,0)  —  fxy  (0,0)2  =  2-  2-  0  =  4>0, 
so  there  is  a  local  minimum  at  (0, 0) ,  and  there  are  no  other  possibilities. 


Solution.  The  derivatives: 

fx  =  2x  fy=  -2 y  fxx  =  2  fyy=  ~2  fxy  =  0. 

Again  there  is  a  single  critical  point,  at  (0,0),  and 

£>(0,0)  =  /xc(0, 0)^(0, 0)  -fxy( 0,0)2  =  2  •  -2-0  =  -4  <  0, 


so  there  is  neither  a  maximum  nor  minimum  there,  and  so  there  are  no  local  maxima  or  minima.  The 
surface  is  shown  in  Figure  13.9.  X 
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Figure  13.9:  A  saddle  point,  neither  a  maximum  nor  a  minimum. 


Solution.  The  derivatives: 

fx  =  4-r3  fy  =  4 y3  fxx  =  12x2  fyy  =  12 y2  fxy  =  0. 

Again  there  is  a  single  critical  point,  at  (0,0),  and 

£>(0,0)  =  fxxMfyyM  »/xy(0,0)2  =0  0^0=0, 

so  we  get  no  information.  However,  in  this  case  it  is  easy  to  see  that  there  is  a  minimum  at  (0,0),  because 
/(0,0)  =  0  and  at  all  other  points  f(x,y )  >0.  X 


Solution.  The  derivatives: 

fx  =  3-r2  fy  =  3  y2  fxx  =  6x2  fyy  =  6 y2  fxy  =  0. 

Again  there  is  a  single  critical  point,  at  (0,0),  and 

£>(0,0)  =  fxx(0,0)fyy(0,0)  -  /xy(0,0)2  =  0  •  0  -  0  =  0, 
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so  we  get  no  information.  In  this  case,  a  little  thought  shows  there  is  neither  a  maximum  nor  a  minimum 
at  (0,0):  when  x  and  y  are  both  positive,  f(x,y )  >  0,  and  when  x  and  y  are  both  negative,  f(x,y )  <  0,  and 
there  are  points  of  both  kinds  arbitrarily  close  to  (0,0).  Alternately,  if  we  look  at  the  cross-section  when 
y  =  0,  we  get  f(x,  0)  —  x3,  which  does  not  have  either  a  maximum  or  minimum  at  x  —  0.  X 


Example  13.30:  Optimizing  Dimensions  of  a  Box 


Suppose  a  box  with  no  top  is  to  hold  a  certain  volume  V.  Find  the  dimensions  for  the  box  that  result 
in  the  minimum  surface  area. 


Solution  The  area  of  the  box  is  A  —  2 hw  +  2 hi  +  Iw,  and  the  volume  is  V  =  Iwh,  so  we  can  write  the  area 
as  a  function  of  two  variables, 

2V  2V 

A(l,w )  =  — —  H - K  lw. 


I 


w 


Then 


„  2  V  J  „  2V  , 

Ai  —  ——. T  +  w  and  Aw  — - T  + 1. 

F  w~ 


If  we  set  these  equal  to  zero  and  solve,  we  find  w  =  (2V)1/3  and  l  =  (2V)1//3,  and  the  corresponding  height 
is  h  =  V/(2V)2/3. 

The  second  derivatives  are 


.  4V 

a«  =  7t 


_4V  _ 

A\vw  —  t  A/w  1, 
w5 


so  the  discriminant  is 


4V  4V 

D=  -= — =-■ 


l 3 


1=4— 1  =  3>0. 


vr’ 


Since  A//  is  2,  there  is  a  local  minimum  at  the  critical  point.  Is  this  a  global  minimum?  It  is,  but  it  is 
difficult  to  see  this  analytically;  physically  and  graphically  it  is  clear  that  there  is  a  minimum,  in  which 
case  it  must  be  at  the  single  critical  point.  X 


Recall  that  when  we  did  single  variable  global  maximum  and  minimum  problems,  the  easiest  cases 
were  those  for  which  the  variable  could  be  limited  to  a  finite  closed  interval,  for  then  we  simply  had  to 
check  all  critical  values  and  the  endpoints.  The  previous  example  is  difficult  because  there  is  no  finite 
boundary  to  the  domain  of  the  problem — both  w  and  /  can  be  in  (0,°°).  As  in  the  single  variable  case,  the 
problem  is  often  simpler  when  there  is  a  finite  boundary. 


Theorem  13.31:  Multivariate  Absolute  Extrema 


If  f(x,y)  is  continuous  on  a  closed  and  bounded  subset  of  Mr,  then  it  has  both  a  maximum  and 
minimum  value. 


As  in  the  case  of  single  variable  functions,  this  means  that  the  maximum  and  minimum  values  must 
occur  at  a  critical  point  or  on  the  boundary;  in  the  two  variable  case,  however,  the  boundary  is  a  curve,  not 
merely  two  endpoints. 
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Example  13.32:  Optimizing  Volume  of  a  Box 


The  length  of  the  diagonal  of  a  box  is  to  be  1  meter;  find  the  maximum  possible  volume. 


Solution.  If  the  box  is  placed  with  one  corner  at  the  origin,  and  sides  along  the  axes,  the  length  of  the 
diagonal  is  \/ x2  +y2  +  z2,  and  the  volume  is 

V  —  xyz  —  xy\/ 1  —x2  —  y2. 

Clearly,  x2  +y2  <  1,  so  the  domain  we  are  interested  in  is  the  quarter  of  the  unit  disk  in  the  first  quadrant. 
Computing  derivatives: 

_  y  —  2yx2  —  y3 
*  y/1  -x2-y2 

x  —  2xy2  —  x3 
y  \J  1  —  x2  —  y2 


If  these  are  both  0,  then  x  —  0  or  y  —  0,  or  x  —  y  —  1  /  \/3 .  The  boundary  of  the  domain  is  composed  of  three 
curves:  x  —  0  for  y  e  [0, 1] ;  y  =  0  for  x  G  [0, 1] ;  and  x2  +y2  =  1,  where  x  >  0  and  y  >  0.  In  all  three  cases, 
the  volume  xy^/l  —x2—y2  is  0,  so  the  maximum  occurs  at  the  only  critical  point  ( 1  / y/3, 1  / \/3, 1  / \/3). 
See  Figure  13.10.  £ 


Figure  13.10:  The  volume  of  a  box  with  fixed  length  diagonal. 
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Exercises  for  13.7 


Exercise  13.7.1  Find  all  local  maximum  and  minimum  points  of  f  —  x2  +  4 y2  —  2x  +  8_y  —  1. 

Exercise  13.7.2  Find  all  local  maximum  and  minimum  points  of  f  —  x2  —  y2  +  6x  —1  0  y  +  2. 

Exercise  13.7.3  Find  all  local  maximum  and  minimum  points  of  f  =  xy. 

Exercise  13.7.4  Find  all  local  maximum  and  minimum  points  of  f  —  9  +  4 x  —  y  —  lx2  —  3 y2. 

Exercise  13.7.5  Find  all  local  maximum  and  minimum  points  of  f  —  x2  +  4 xy  +  y2  —  6y  +  1. 

Exercise  13.7.6  Find  all  local  maximum  and  minimum  points  of  f  —  x2  —  xy  +  2y2  —  5x  +  6y  —  9. 

Exercise  13.7.7  Find  the  absolute  maximum  and  minimum  points  of  f  —  x2  +  3y  —  3xy  over  the  region 
bounded  by  y  —  x,  y  —  0,  and  x  =  2. 

Exercise  13.7.8  A  six-sided  rectangular  box  is  to  hold  1/2  cubic  meter;  what  shape  should  the  box  be  to 
minimize  surface  area? 

Exercise  13.7.9  The  post  office  will  accept  packages  whose  combined  length  and  girth  is  at  most  130 
inches.  (Girth  is  the  maximum  distance  around  the  package  perpendicular  to  the  length;  for  a  rectangular 
box,  the  length  is  the  largest  of  the  three  dimensions.)  What  is  the  largest  volume  that  can  be  sent  in  a 
rectangular  box? 

Exercise  13.7.10  The  bottom  of  a  rectangular  box  costs  twice  as  much  per  unit  area  as  the  sides  and  top. 
Find  the  shape  for  a  given  volume  that  will  minimize  cost. 

Exercise  13.7.11  Using  the  methods  of  this  section,  find  the  shortest  distance  from  the  origin  to  the  plane 
x+y  +  z  =  10. 

Exercise  13.7.12  Using  the  methods  of  this  section,  find  the  shortest  distance  from  the  point  (a'o.vo.zo)  to 
the  plane  ax  +  by  +  cz  —  d.  You  may  assume  that  c  f  0;  use  of  Sage  or  similar  software  is  recommended. 

Exercise  13.7.13  A  trough  is  to  be  formed  by  bending  up  two  sides  of  a  long  metal  rectangle  so  that  the 
cross-section  of  the  trough  is  an  isosceles  trapezoid.  If  the  width  of  the  metal  sheet  is  2  meters,  how  should 
it  be  bent  to  maximize  the  volume  of  the  trough? 

Exercise  13.7.14  Given  the  three  points  (1,4),  (5,2),  and  (3,  —2),  (x  —  l)2  +  (_y  —  4)2  +  (x  —  5)2  +  (y  — 
2)2  +  (x  —  3j2  f  (v  +  2)2  is  the  sum  of  the  squares  of  the  distances  from  point  (x, >’)  to  the  three  points. 
Find  x  and  y  so  that  this  quantity  is  minimized. 


Exercise  13.7.15  Suppose  that  f(x,y)  —  x2  +y2  +  kxy.  Find  and  classify  the  critical  points,  and  discuss 
how  they  change  when  k  takes  on  different  values. 
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Exercise  13.7.16  Find  the  shortest  distance  from  the  point  ((),/?)  to  the  parabola  y  —  x2. 

Exercise  13.7.17  Find  the  shortest  distance  from  the  point  (0,0,/?)  to  the  paraboloid  z  —  x2  +  y2. 
Exercise  13.7.18  Consider  the  function  f(x,y)  —  x3  —  3  x2y  +  yi. 

(a)  Show  that  (0,0)  is  the  only  critical  point  of  f. 

(b)  Show  that  the  discriminant  test  is  inconclusive  for  f. 

(c)  Determine  the  cross-sections  of  f  obtained  by  setting  y  —  kxfor  various  values  ofk. 

( d)  What  kind  of  critical  point  is  (0, 0)  ? 

Exercise  13.7.19  Find  the  volume  of  the  largest  rectangular  box  with  edges  parallel  to  the  axes  that  can 
be  inscribed  in  the  ellipsoid  lx2  +  12y2  +  1 8z2  =  288. 


13.8  Lagrange  Multipliers 


Many  applied  max/min  problems  take  the  following  form:  we  want  to  find  an  extreme  value  of  a  function, 
like  V  =  xyz,  subject  to  a  constraint,  like  1  =  \J x2  +y2  +  z1 .  Often  this  can  be  done,  as  we  have,  by 
explicitly  combining  the  equations  and  then  finding  critical  points.  There  is  another  approach  that  is  often 
convenient,  the  method  of  Lagrange  multipliers. 

It  is  somewhat  easier  to  understand  two  variable  problems,  so  we  begin  with  one  as  an  example. 
Suppose  the  perimeter  of  a  rectangle  is  to  be  100  units.  Find  the  rectangle  with  largest  area.  This  is  a 
fairly  straightforward  problem  from  single  variable  calculus.  We  write  down  the  two  equations:  A  =  xy, 
P  =  100  —  2x  +  2y,  solve  the  second  of  these  for  y  (or  x),  substitute  into  the  first,  and  end  up  with  a  one- 
variable  maximization  problem.  Let’s  now  think  of  it  differently:  the  equation  A  =  xy  defines  a  surface, 
and  the  equation  100  —  2x  +  2y  defines  a  curve  (a  line,  in  this  case)  in  the  x-y  plane.  If  we  graph  both  of 
these  in  the  three-dimensional  coordinate  system,  we  can  phrase  the  problem  like  this:  what  is  the  highest 
point  on  the  surface  above  the  line?  The  solution  we  already  understand  effectively  produces  the  equation 
of  the  cross-section  of  the  surface  above  the  line  and  then  treats  it  as  a  single  variable  problem.  Instead, 
imagine  that  we  draw  the  level  curves  (the  contour  lines)  for  the  surface  in  the  x-y  plane,  along  with  the 
line. 
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Figure  13.11:  Constraint  line  with  contour  plot  of  the  surface  xy. 

Imagine  that  the  line  represents  a  hiking  trail  and  the  contour  lines  are,  as  on  a  topographic  map,  the 
lines  of  constant  altitude.  How  could  you  estimate,  based  on  the  graph,  the  high  (or  low)  points  on  the 
path?  As  the  path  crosses  contour  lines,  you  know  the  path  must  be  increasing  or  decreasing  in  elevation. 
At  some  point  you  will  see  the  path  just  touch  a  contour  line  (tangent  to  it),  and  then  begin  to  cross  contours 
in  the  opposite  order — that  point  of  tangency  must  be  a  maximum  or  minimum  point.  If  we  can  identify 
all  such  points,  we  can  then  check  them  to  see  which  gives  the  maximum  and  which  the  minimum  value. 
As  usual,  we  also  need  to  check  boundary  points;  in  this  problem,  we  know  that  x  and  y  are  positive,  so 
we  are  interested  in  just  the  portion  of  the  line  in  the  first  quadrant,  as  shown.  The  endpoints  of  the  path, 
the  two  points  on  the  axes,  are  not  points  of  tangency,  but  they  are  the  two  places  that  the  function  xy  is  a 
minimum  in  the  first  quadrant. 

How  can  we  actually  make  use  of  this?  At  the  points  of  tangency  that  we  seek,  the  constraint  curve 
(in  this  case  the  line)  and  the  level  curve  have  the  same  slope — their  tangent  lines  are  parallel.  This  also 
means  that  the  constraint  curve  is  perpendicular  to  the  gradient  vector  of  the  function;  going  a  bit  further, 
if  we  can  express  the  constraint  curve  itself  as  a  level  curve,  then  we  seek  the  points  at  which  the  two  level 
curves  have  parallel  gradients.  The  curve  100  —  2x  +  2 y  can  be  thought  of  as  a  level  curve  of  the  function 
2x  +  2 y;  Figure  13.12  shows  both  sets  of  level  curves  on  a  single  graph.  We  are  interested  in  those  points 
where  two  level  curves  are  tangent — but  there  are  many  such  points,  in  fact  an  infinite  number,  as  we’ve 
only  shown  a  few  of  the  level  curves.  All  along  the  line  y  —  x  are  points  at  which  two  level  curves  are 
tangent.  While  this  might  seem  to  be  a  show-stopper,  it  is  not. 
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Figure  13.12:  Contour  plots  for  2x  +  2 y  and  xy. 

The  gradient  of  2x+2 y  is  (2, 2),  and  the  gradient  of  xy  is  (y,x) .  They  are  parallel  when  (2, 2)  —  X  (y,x), 
that  is,  when  2  —  Xy  and  2  =  Xx.  We  have  two  equations  in  three  unknowns,  which  typically  results  in 
many  solutions  (as  we  expected).  A  third  equation  will  reduce  the  number  of  solutions;  the  third  equation 
is  the  original  constraint,  100  =  2x+2y.  So  we  have  the  following  system  to  solve: 

2  —  Xy  2  =  Xx  100  =  2x  +  2 y. 

In  the  first  two  equations,  X  can’t  be  0,  so  we  may  divide  by  it  to  get  x  —  y  =  2/ X.  Substituting  into  the 
third  equation  we  get 


a2 

2—  +  2— 
A  X 

8 


100 


100 

X 


so  x  =  y  =  25.  Note  that  we  are  not  really  interested  in  the  value  of  X — it  is  a  clever  tool,  the  Lagrange 
multiplier,  introduced  to  solve  the  problem.  In  many  cases,  as  here,  it  is  easier  to  find  X  than  to  find 
everything  else  without  using  X . 

The  same  method  works  for  functions  of  three  variables,  except  of  course  everything  is  one  dimen¬ 
sion  higher:  the  function  to  be  optimized  is  a  function  of  three  variables  and  the  constraint  represents  a 
surface — for  example,  the  function  may  represent  temperature,  and  we  may  be  interested  in  the  maximum 
temperature  on  some  surface,  like  a  sphere.  The  points  we  seek  are  those  at  which  the  constraint  surface 
is  tangent  to  a  level  surface  of  the  function.  Once  again,  we  consider  the  constraint  surface  to  be  a  level 
surface  of  some  function,  and  we  look  for  points  at  which  the  two  gradients  are  parallel,  giving  us  three 
equations  in  four  unknowns.  The  constraint  provides  a  fourth  equation. 


Example  13.33:  Optimization  with  Constraints 


Maximize  the  function  xyz  given  the  constraint  1  =  \J x1  +  y 2  +  z2 . 
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Solution.  The  constraint  is  1  =  \J x2  -\-y2  +  z2,  which  is  the  same  as  1  =  x2  +  v2  +  z2.  The  function  to 
maximize  is  xyz.  The  two  gradient  vectors  are  ( 2x,2y,2z )  and  (yz,xz,xy),  so  the  equations  to  be  solved  are 

yz  —  2xA 
xz  —  2  yA 
xy  —  2zA 
1  =  x  +y  +z 


If  A  =  0  then  at  least  two  of  x,  y,  z.  must  be  0,  giving  a  volume  of  0,  which  will  not  be  the  maximum.  If  we 
multiply  the  first  two  equations  by  x  and  y  respectively,  we  get 

xyz  —  2x2A 
xyz  =  2y2A 


so  2x2A  =  2y2A  or  x2  =  y2 ;  in  the  same  way  we  can  show  x2  —  z2.  Hence  the  fourth  equation  becomes 
1  =  x2  +x2  +x2  or  x  =  1/  v/3,  and  so  x  =  y  =  z  =  1/  \/3  gives  the  maximum  volume.  This  is  of  course  the 
same  answer  we  obtained  previously.  X 

Another  possibility  is  that  we  have  a  function  of  three  variables,  and  we  want  to  find  a  maximum  or 
minimum  value  not  on  a  surface  but  on  a  curve;  often  the  curve  is  the  intersection  of  two  surfaces,  so  that 
we  really  have  two  constraint  equations,  say  g(x,y,z)  =  c i  and  h(x,y,z )  =  c'2-  It  turns  out  that  at  points  on 
the  intersection  of  the  surfaces  where  /  has  a  maximum  or  minimum  value, 

V/  =  AVg  +  /iVfi. 

As  before,  this  gives  us  three  equations,  one  for  each  component  of  the  vectors,  but  now  in  five  unknowns, 
x,  y,  z.  A,  and  /i.  Since  there  are  two  constraint  functions,  we  have  a  total  of  five  equations  in  five 
unknowns,  and  so  can  usually  find  the  solutions  we  need. 


Example  13.34:  Intersection  of  a  Plane  with  a  Cylinder 


The  planex+y  —  z—  1  intersects  the  cylinder x2  +  v2  =  1  in  an  ellipse.  Find  the  points  on  the  ellipse 
closest  to  and  farthest  from  the  origin. 


Solution.  We  want  the  extreme  values  of  /  =  y/x2  +  y2  +  z2  subject  to  the  constraints  g  =  x2  +  y2  =  1  and 
h  —  x  +  y  —  z—  1.  To  simplify  the  algebra,  we  may  use  instead  f  —  x2  +y2  +  z2,  since  this  has  a  maximum 
or  minimum  value  at  exactly  the  points  at  which  \J x2  +y2+z2  does.  The  gradients  are 

V/=  (2x,2y,2z)  Vg  =  (2x,2y,0)  Vh  =  ( 1,1, -1), 


so  the  equations  we  need  to  solve  are 


2x  =  A2x  +  /i 
2 y  —  A2  y  +  p 
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2z  =  0  -  p 
1  =  x2+y2 
1  =  x  +  y-z. 

Subtracting  the  first  two  we  get  2y  —  2x  =  A  (2 y  —  2x),  so  either  A  =  1  or  x  —  y.  If  A  =  1  then  p  —  0,  so 
z  —  0  and  the  last  two  equations  are 

l=x2+;y2  and  l=x  +  _y. 

Solving  these  gives  x=  1,  y  =  0,  or  x  =  0,  y  —  1,  so  the  points  of  interest  are  (1,0,0)  and  (0, 1,0),  which 
are  both  distance  1  from  the  origin.  If  x  =  y,  the  fourth  equation  is  2x2  =  1,  giving  x  =  y  =  ±1/ y/2,  and 
from  the  fifth  equation  we  get  z=  —  1  ±  y/2.  The  distance  from  the  origin  to  (1/ y/2, 1  / -s/2,  —  1  +  y/2)  is 
y/ 4  —  2y/~2  fa  1.08  and  the  distance  from  the  origin  to  (—1/ y/2,  —  \/ y/2,—1  —  y/2)  is  \J 4  +  2\/2  fa  2.6. 
Thus,  the  points  (1,0,0)  and  (0,1,0)  are  closest  to  the  origin  and  ( —  1  / y/2,  —  1  / y/2,  —  1  —  y/2)  is  farthest 
from  the  origin.  £ 


Exercises  for  13.8 


Exercise  13.8.1  A  six-sided  rectangular  box  is  to  hold  1/2  cubic  meter;  what  shape  should  the  box  be  to 
minimize  surface  area? 

Exercise  13.8.2  The  post  office  will  accept  packages  whose  combined  length  and  girth  are  at  most  130 
inches  (girth  is  the  maximum  distance  around  the  package  perpendicular  to  the  length ).  What  is  the  largest 
volume  that  can  be  sent  in  a  rectangular  box? 

Exercise  13.8.3  The  bottom  of  a  rectangular  box  costs  twice  as  much  per  unit  area  as  the  sides  and  top. 
Find  the  shape  for  a  given  volume  that  will  minimize  cost. 

Exercise  13.8.4  Using  Lagrange  multipliers,  find  the  shortest  distance  from  the  point  (xo,vo,zo)  to  the 
plane  ax  +  by  +  cz  —  d. 

Exercise  13.8.5  Find  all  points  on  the  surface  xy  —  z2  +  1  =  0  that  are  closest  to  the  origin. 

Exercise  13.8.6  The  material  for  the  bottom  of  an  aquarium  costs  half  as  much  as  the  high  strength  glass 
for  the  four  sides.  Find  the  shape  of  the  cheapest  aquarium  that  holds  a  given  volume  V. 

Exercise  13.8.7  The  plane  x  —  y  +  z  —  2  intersects  the  cylinder  x2  +y2  —  4  in  an  ellipse.  Find  the  points 
on  the  ellipse  closest  to  and  farthest  from  the  origin. 


Exercise  13.8.8  Find  three  positive  numbers  whose  sum  is  48  and  whose  product  is  as  large  as  possible. 
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Exercise  13.8.9  Find  all  points  on  the  plane  x  +  y  +  z  —  5  in  the  first  octant  at  which  f(x,y,z )  —  xy2z2  has 
a  maximum  value. 


Exercise  13.8.10  Find  the  points  on  the  surface  x2  —  yz  =  5  that  are  closest  to  the  origin. 

Exercise  13.8.11  A  manufacturer  makes  two  models  of  an  item,  standard  and  deluxe.  It  costs  $40  to 
manufacture  the  standard  model  and  $60  for  the  deluxe.  A  market  research  firm  estimates  that  if  the 
standard  model  is  priced  at  x  dollars  and  the  deluxe  at  y  dollars,  then  the  manufacturer  will  sell  500(_y  —  x) 
of  the  standard  items  and  45,000  +  500(.r  —  2 y)  of  the  deluxe  each  year.  How  should  the  items  be  priced 
to  maximize  profit? 


Exercise  13.8.12  A  length  of  sheet  metal  is  to  be  made  into  a  water  trough  by  bending  up  two  sides  as 
shown  in  Figure  13.13.  Find  x  and  (j)  so  that  the  trapezoid-shaped  cross  section  has  maximum  area,  when 
the  width  of  the  metal  sheet  is  27  inches  (that  is,  2x  +  y  —  21). 


y 


Figure  13.13:  Cross-section  of  a  trough. 


Exercise  13.8.13  Find  the  maximum  and  minimum  values  of  f(x,y,z)  —  6x+3y  +  2z  subject  to  the  con¬ 
straint  g(x,y,  z)  —  4x2  +  2 y2  +  z2  —  70  —  0. 

Exercise  13.8.14  Find  the  maximum  and  minimum  values  off(x,y)  —  exy  subject  to  the  constraint  g(x,y)  — 
x3+y3-  16  =  0. 

Exercise  13.8.15  Find  the  maximum  and  minimum  values  off(x,y)  —  xy  +  \/ 9  —  x2  —  y2  when  x2  +y2  < 
9. 

Exercise  13.8.16  Find  three  real  numbers  whose  sum  is  9  and  the  sum  of  whose  squares  is  a  small  as 
possible. 

Exercise  13.8.17  Find  the  dimensions  of  the  closed  rectangular  box  with  maximum  volume  that  can  be 
inscribed  in  the  unit  sphere. 
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14.1  Volume  and  Average  Height 


Consider  a  surface  f(x,y);  you  might  temporarily  think  of  this  as  representing  physical  topography — a 
hilly  landscape,  perhaps.  What  is  the  average  height  of  the  surface  (or  average  altitude  of  the  landscape) 
over  some  region? 

As  with  most  such  problems,  we  start  by  thinking  about  how  we  might  approximate  the  answer.  Sup¬ 
pose  the  region  is  a  rectangle,  [a,b\  x  [c,d].  We  can  divide  the  rectangle  into  a  grid,  m  subdivisions  in 
one  direction  and  n  in  the  other,  as  indicated  in  Figure  14.1.  We  pick  x  values  xo,  x\,. . x„,_i  in  each 
subdivision  in  the  x  direction,  and  similarly  in  the  y  direction.  At  each  of  the  points  ( Xj,yj )  in  one  of  the 
smaller  rectangles  in  the  grid,  we  compute  the  height  of  the  surface:  f{xuyj).  Now  the  average  of  these 
heights  should  be  (depending  on  the  fineness  of  the  grid)  close  to  the  average  height  of  the  surface: 

/OoAo )  +/(-Wyo)H - h/(x0,yi)  +/(xi,yi)H - l-/(*,»-i,y>i-i) 

mil 

As  both  m  and  n  go  to  infinity,  we  expect  this  approximation  to  converge  to  a  fixed  value,  the  actual 
average  height  of  the  surface.  For  reasonably  nice  functions  this  does  indeed  happen. 


d- 

k3  - 
k2  - 

yi  - 

c  - 


Ax 


Ay 


l  i  i  i  i  i  i 

a  x\  X2  X'i  X4  X5  b 

Figure  14.1:  A  rectangular  subdivision  of  [a,b\  x  [c,d\. 


Using  sigma  notation,  we  can  rewrite  the  approximation: 


1 


n—  1  /7Z —  1 


—  E  E  /(w) 


i 


n—  1  in—  1 


nm  ■ 


i= 0  j= 0 


n  \(j  aEE/M 

(b-a)(d-c)  (tojr0 
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The  two  parts  of  this  product  have  useful  meaning:  (b  —  a)(d  —  c )  is  of  course  the  area  of  the  rectangle, 
and  the  double  sum  adds  up  mn  terms  of  the  form  f(xj,yi)AxAy,  which  is  the  height  of  the  surface  at  a 
point  multiplied  by  the  area  of  one  of  the  small  rectangles  into  which  we  have  divided  the  large  rectangle. 
In  short,  each  term  f(xj,yi)AxAy  is  the  volume  of  a  tall,  thin,  rectangular  box,  and  is  approximately  the 
volume  under  the  surface  and  above  one  of  the  small  rectangles;  see  Figure  14.2.  When  we  add  all  of  these 
up,  we  get  an  approximation  to  the  volume  under  the  surface  and  above  the  rectangle  R  —  [a,b]  x  [c,d]. 
When  we  take  the  limit  as  m  and  n  go  to  infinity,  the  double  sum  becomes  the  actual  volume  under  the 
surface,  which  we  divide  by  (b  —  a)(d  —  c )  to  get  the  average  height. 


Figure  14.2:  Approximating  the  volume  under  a  surface. 


Double  sums  like  this  come  up  in  many  applications,  so  in  a  way  it  is  the  most  important  part  of  this 
example;  dividing  by  (b  —  a)(d  —  c )  is  a  simple  extra  step  that  allows  the  computation  of  an  average.  As 
we  did  in  the  single  variable  case,  we  introduce  a  special  notation  for  the  limit  of  such  a  double  sum: 


n —  1  in —  1 


11 —  1  III —  1  no  no 

lim  E  V  f(xj,yi)AxAy  =  //  f(x,y)dxdy=  //  f(x,y)dA, 
ito  ./to  JJr 


the  double  integral  of  /  over  the  region  R.  The  notation  dA  indicates  a  small  bit  of  area,  without  specifying 
any  particular  order  for  the  variables  x  and  y;  it  is  shorter  and  more  “generic”  than  writing  dxdy.  The 
average  height  of  the  surface  in  this  notation  is 

(b-a)(d-c)ILf(x'y)dA- 


The  next  question,  of  course,  is:  How  do  we  compute  these  double  integrals?  You  might  think  that  we 
will  need  some  two-dimensional  version  of  the  Fundamental  Theorem  of  Calculus,  but  as  it  turns  out  we 
can  get  away  with  just  the  single  variable  version,  applied  twice. 

Going  back  to  the  double  sum,  we  can  rewrite  it  to  emphasize  a  particular  order  in  which  we  want  to 
add  the  terms: 
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In  the  sum  in  parentheses,  only  the  value  of  xj  is  changing;  y,  is  temporarily  constant.  As  m  goes  to  inhnity, 
this  sum  has  the  right  form  to  turn  into  an  integral: 


m—  1  ,-b 

lim  Y  f(xj,yi)Ax=  /  f(x,yi)dx. 
m^°°  “0  Ja 


So  after  we  take  the  limit  as  m  goes  to  inhnity,  the  sum  is 


f(x,yi)  dx  Ay. 


n—  1 

E 

i= 0 

Of  course,  for  different  values  of  y,  this  integral  has  different  values;  in  other  words,  it  is  really  a  function 
applied  to  y,: 


G(y)=/  f{x,y)dx. 


If  we  substitute  back  into  the  sum  we  get 


E  G(^')A.v- 

i= 0 


This  sum  has  a  nice  interpretation.  The  value  G(y,-)  is  the  area  of  a  cross  section  of  the  region  under  the 
surface  f(x,y),  namely,  when  y  =  y/.  The  quantity  G(y,-)Ay  can  be  interpreted  as  the  volume  of  a  solid 
with  face  area  G(y;)  and  thickness  Ay.  Think  of  the  surface  f(x,y )  as  the  top  of  a  loaf  of  sliced  bread. 
Each  slice  has  a  cross-sectional  area  and  a  thickness;  G(y,)Ay  corresponds  to  the  volume  of  a  single  slice 
of  bread.  Adding  these  up  approximates  the  total  volume  of  the  loaf.  (This  is  very  similar  to  the  technique 
we  used  to  compute  volumes  in  Section  8.3,  except  that  there  we  need  the  cross-sections  to  be  in  some 
way  “the  same”.)  Figure  14.3  shows  this  “sliced  loaf”  approximation  using  the  same  surface  as  shown  in 
Figure  14.2.  Nicely  enough,  this  sum  looks  just  like  the  sort  of  sum  that  turns  into  an  integral,  namely, 

n —  1 


lim  £  G(y,-)Ay  =  I  G(y)  dy 

"“►“.to  Jc 


rd  rb 


f(x,y )  dxdy. 


Fet’s  be  clear  about  what  this  means:  we  hrst  will  compute  the  inner  integral,  temporarily  treating  y  as  a 
constant.  We  will  do  this  by  finding  an  anti-derivative  with  respect  to  x,  then  substituting  x  —  a  and  x  —  b 
and  subtracting,  as  usual.  The  result  will  be  an  expression  with  no  x  variable  but  some  occurrences  of  y. 
Then  the  outer  integral  will  be  an  ordinary  one- variable  problem,  with  y  as  the  variable. 
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Figure  14.3:  Approximating  the  volume  under  a  surface  with  slices. 


Solution.  The  volume  under  this  surface  is 


The  inner  integral  is 


sin(xy)  +  -  dx  — 


f 2-5  r 3.5  5 
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Unfortunately,  this  gives  a  function  for  which  we  can’t  hnd  a  simple  anti-derivative.  To  complete  the 
problem  we  could  use  Sage  or  similar  software  to  approximate  the  integral.  Doing  this  gives  a  volume  of 
approximately  8.84,  so  the  average  height  is  approximately  8.84/6  w  1.47.  X 


Because  addition  and  multiplication  are  commutative  and  associative,  we  can  rewrite  the  original  dou¬ 
ble  sum: 

n—lm—l  m— In— 1 

L  H  /(*;,yi)AxAy  =  £  £  f(xj,yi)AyAx. 
i= 0  y=0  7=0  ;'=0 

Now  if  we  repeat  the  development  above,  the  inner  sum  turns  into  an  integral: 

n—  1 


i=0 

and  then  the  outer  sum  turns  into  an  integral: 

'”-1  /  rd 

Y  I  l 

m— >oo 


n— 1  /*<3 

lim  E  f(xj,yi)Ay  =  /  f(xj,y)dy, 
1-^°°  Jc 


1  /  ra  \  rb  rd 

lim  V  /  f(xj,y)dy  )  Ax  =  /  /  f(x,y)dydx. 
n^°°  ;“o  \./c  /  7a  7c 


14.1.  Volume  and  Average  Height  497 


In  other  words,  we  can  compute  the  integrals  in  either  order,  first  with  respect  to  x  then  y,  or  vice  versa. 
Thinking  of  the  loaf  of  bread,  this  corresponds  to  slicing  the  loaf  in  a  direction  perpendicular  to  the  first. 

We  haven’t  really  proved  that  the  value  of  a  double  integral  is  equal  to  the  value  of  the  corresponding 
two  single  integrals  in  either  order  of  integration,  but  provided  the  function  is  reasonably  nice,  this  is  true; 
the  result  is  called  Fubini’s  Theorem. 


Example  14.2:  Compute  Volume  in  Two  Ways 

We  compute  J 

j  1  +  (x—  l)2  +  4y2dA,  whereR—  [0,3]  x  [0,2 ],  in  two  ways. 

Solution.  First, 


[  [  l  +  (x-l)2  +  4 y2dydx=  [  y+  (x-  l)2y  +  \y 
J 0  Jo  Jo  3 

r3  30 

=  /  2  +  2(x—  l)2  +  —  dx 
Jo  3 


„  2  in3  32 

=  2x+-(x- l)3  +  yX 

2  32  2 

=  6+3-8  +  V-3-(0-1-3+0) 

=  44. 


In  the  other  order: 


n3  /-2  ( x 1^3 

1  +  (x  —  l)2  +  4v2 dx dy  —  /  x+- — - |-4v2x 

Jo  3 


dy 


o 


f2  8  ,  1 

=  1  3  +  3  +  Uy  +3dy 


8 


1 


=  3y  +  -y  +  4/  +  -y 

,  16  ^  2 
=  6  +  y  +  32+- 

=  44. 


X 

In  this  example  there  is  no  particular  reason  to  favor  one  direction  over  the  other;  in  some  cases, 
one  direction  might  be  much  easier  than  the  other,  so  it’s  usually  worth  considering  the  two  different 
possibilities. 

Frequently  we  will  be  interested  in  a  region  that  is  not  simply  a  rectangle.  Let’s  compute  the  volume 
under  the  surface  x  +  2 y2  above  the  region  described  by  0  <  x  <  1  and  0  <y<*2,  shown  in  Figure  14.4. 
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1 


0  J L 

0  1 


Figure  14.4:  A  parabolic  region  of  integration. 


In  principle  there  is  nothing  more  difficult  about  this  problem.  If  we  imagine  the  three-dimensional 
region  under  the  surface  and  above  the  parabolic  region  as  an  oddly  shaped  loaf  of  bread,  we  can  still 
slice  it  up,  approximate  the  volume  of  each  slice,  and  add  these  volumes  up.  For  example,  if  we  slice 
perpendicular  to  the  x  axis  at  x;,  the  thickness  of  a  slice  will  be  Ax  and  the  area  of  the  slice  will  be 


/  Xj  +  2 y2  dy. 

Jo 

When  we  add  these  up  and  take  the  limit  as  Ax  goes  to  0,  we  get  the  double  integral 

f  *  2 

x  +  2y2  dydx  =  /  xy+-v3 
Jo  3 


dx 


o 

=  f  x3  4-  -x6  dx 


10 


l 


-  4+2F  0 
1  2  _  29 

“  4  +  21  “  84' 

We  could  just  as  well  slice  the  solid  perpendicular  to  the  y  axis,  in  which  case  we  get 


[  [  x  +  2 y1  dxdy  —  [  — +  2y2x 
Jo  J  yjy  Jo  2 


dy 


\ /y 


’!  1 


=  ./o  2+2y  -2~2y^dy 

=  l  +  2y-yL--yVi' 

2  3"  4  T  0 

12  1  4  29 

“2  +  3-4~7_84' 


What  is  the  average  height  of  the  surface  over  this  region?  As  before,  it  is  the  volume  divided  by  the  area 
of  the  base,  but  now  we  need  to  use  integration  to  compute  the  area  of  the  base,  since  it  is  not  a  simple 
rectangle.  The  area  is 


x1  dx 


1 

3’ 


so  the  average  height  is  29/28. 
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Example  14.3:  Volume  of  Region 


Find  the  volume  under  the  surface  z  —  a/  1  —  x2  and  above  the  triangle  formed  by  y  —  x,  x  —  1,  and 
the  x-axis. 


Solution.  Let’s  consider  the  two  possible  ways  to  set  this  up: 


1  px 


0  JO 


\J  1  —x2dydx 


1  /•! 


or 


0  Jy 


a/  1  —x2dxdy. 


Which  appears  easier?  In  the  first,  the  first  (inner)  integral  is  easy,  because  we  need  an  anti-derivative  with 
respect  to  y,  and  the  entire  integrand  \J  1  —  x2  is  constant  with  respect  to  y.  Of  course,  the  second  integral 
may  be  more  difficult.  In  the  second,  the  first  integral  is  mildly  unpleasant — a  trig  substitution.  So  let’s 
try  the  first  one,  since  the  first  step  is  easy,  and  see  where  that  leaves  us. 


—  x2dydx 


dx  — 


—  x2  dx. 


This  is  quite  easy,  since  the  substitution  u—l—x2  works: 


j'  X\/ 1  —  A'2  dx  =  —  -  fVudu  —  =  —  2  —  X2)2/2. 

Then 

[  x\J  1  —  x- dx  —  —  \ ( 1  —  x2)3/2  =\. 

Jo  3  0  3 

This  is  a  good  example  of  how  the  order  of  integration  can  affect  the  complexity  of  the  problem.  In  this 
case  it  is  possible  to  do  the  other  order,  but  it  is  a  bit  messier.  In  some  cases  one  order  may  lead  to  a  very 
difficult  or  impossible  integral;  it’s  usually  worth  considering  both  possibilities  before  going  very  far.  X 


Exercises  for  14.1 


Exercise  14.1.1 


Exercise  14.1.2 


Exercise  14.1.3 


Exercise  14.1.4 


Compute 


Compute 


Compute 


Compute 


1  +xdydx. 


l-i  Jo 


x  +  ydydx. 
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Exercise  14.1.5  Compute 


Exercise  14.1.6  Compute 


Exercise  14.1.7  Compute 


Exercise  14.1.8  Compute 


drdG. 


Exercise  14.1.9  Compute: 


Exercise  14.1.10  Compute: 


Exercise  14.1.11  Compute: 


Exercise  14.1.12  Compute: 


Exercise  14.1.13  Compute: 


Exercise  14.1.14  Compute 


Exercise  14.1.15  Compute 


10  J^y 

f'f' 

Jo  Jy2 

f'f' 

Jo  Jx2 

/•>’  2 


V-f3  +  Idxdy. 
y  si  n  (.r2  j  dxdy. 
x\J  \  +y2dydx 
dxdy 


Jo  Jo  y/l  -X2 

"1  r3  2 

ex  dxdy 


0  J3y 


’  1  r  1  —xr 


-1  JO 

72/2  rVl-2x2 

0  J~Vl~2x2 


x2  —  yfydydx. 


xdydx. 


Exercise  14.1.16  Evaluate  jj  x2  dA  over  the  region  in  the  first  quadrant  bounded  by  the  hyperbola  xy  = 
16  and  the  lines  y  —  x,  y  —  0,  and  x  —  8. 

Exercise  14.1.17  Find  the  volume  below  z  —  1  —  y  above  the  region  —1  <  x  <  1,  0  <  y  <  1  —  x2. 

Exercise  14.1.18  Find  the  volume  bounded  by  z  —  x2  +y2  and  z  —  4. 

Exercise  14.1.19  Find  the  volume  in  the  first  octant  bounded  by  y2  =  A  —  x  and  y  =  2 z. 

Exercise  14.1.20  Find  the  volume  in  the  first  octant  bounded  by  y2  —  Ax,  2x  +  y  =  A,  z  —  y,  and  y  =  0. 


Exercise  14.1.21  Find  the  volume  in  the  first  octant  bounded  byx+y+z  =  9,  2x+3y  —  18,  and  x + 3y  =  9. 
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Exercise  14.1.22 
Exercise  14.1.23 
Exercise  14.1.24 

Exercise  14.1.25 

(4, 1,0),  (1,2,0). 


Find  the  volume  in  the  first  octant  bounded  by  x1  +y2  —  a2  and  z  =  x  +  y. 

Find  the  volume  bounded  by  4.x2  +  y2  =  4 z  and  z  —  2. 

Find  the  volume  bounded  by  z  —  x2  -\-y2  and  z  —  y. 

Find  the  volume  under  the  surface  z—xy  above  the  triangle  with  vertices  (1,1,0), 


Exercise  14.1.26  Find  the  volume  enclosed  by  y  —  x2,  y  —  4,  z  —  x2,  z  —  0. 

Exercise  14.1.27  A  swimming  pool  is  circular  with  a  40  meter  diameter.  The  depth  is  constant  along 
east-west  lines  and  increases  linearly  from  2  meters  at  the  south  end  to  7  meters  at  the  north  end.  Find  the 
volume  of  the  pool. 

Exercise  14.1.28  Find  the  average  value  of  f  (x, y)  —  ey  ^/x-\-ey  on  the  rectangle  with  vertices  (0, 0),  (4,0), 
(4, 1)  and  (0, 1). 

Exercise  14.1.29  Figure  14.5  shows  a  temperature  map  of  Colorado.  Use  the  data  to  estimate  the  average 
temperature  in  the  state  using  4,  16  and  25  subdivisions.  Give  both  an  upper  and  lower  estimate.  Why  do 
we  like  Colorado  for  this  problem?  What  other  state(s)  might  we  like? 


Figure  14.5:  Colorado  temperatures. 


Exercise  14.1.30  Three  cylinders  of  radius  1  intersect  at  right  angles  at  the  origin,  as  shown  in  Fig¬ 
ure  14.6.  Find  the  volume  contained  inside  all  three  cylinders. 
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Figure  14.6:  Intersection  of  three  cylinders. 


Exercise  14.1.31  Prove  that  if  f{x,y)  is  integrable  and  if  g(x,y)  =  /  /  f(s,t)  dt  ds  then  gxy  —  gyx 

J  a  Jb 

f(x,y). 

Exercise  14.1.32  Reverse  the  order  of  integration  on  each  of  the  following  in  tegrals 
/'9  rV^—y 

(a)  /  f(x,y)dxdy 

Jo  Jo 

nlnx 

f(x,y)  dydx 

rl  rJt/2 

(c)  /  f(x,y)  dx  dy 

JO  Jarcsin  v 

(d)  [  [  f(x,y)  dy  dx 

JO  J 4x 


/*3  ry/9-y2 

(e)  /  f{x,y)  dx  dy 

Jo  Jo 


Exercise  14.1.33  What  are  the  parallels  between  Fubini’s  Theorem  and  Clairaut’s  Theorem? 


14.2  Double  Integrals  in  Polar  Coordinates 


Suppose  we  have  a  surface  given  in  polar  coordinates  as  z  —  f(r,  0)  and  we  wish  to  find  the  integral  over 
some  region.  We  could  attempt  to  translate  into  rectangular  coordinates  and  do  the  integration  there,  but  it 
is  often  easier  to  stay  in  polar  coordinates. 

How  might  we  approximate  the  volume  under  such  a  surface  in  a  way  that  uses  polar  coordinates 
directly?  The  basic  idea  is  the  same  as  before:  we  divide  the  region  into  many  small  regions,  multiply 
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the  area  of  each  small  region  by  the  height  of  the  surface  somewhere  in  that  little  region,  and  add  them 
up.  What  changes  is  the  shape  of  the  small  regions;  in  order  to  have  a  nice  representation  in  terms  of  r 
and  0,  we  use  small  pieces  of  ring-shaped  areas,  as  shown  in  Figure  14.7.  Each  small  region  is  roughly 
rectangular,  except  that  two  sides  are  segments  of  a  circle  and  the  other  two  sides  are  not  quite  parallel. 
Near  a  point  (r,  0),  the  length  of  either  circular  arc  is  about  rAO  and  the  length  of  each  straight  side  is 
simply  A r.  When  Ar  and  A0  are  very  small,  the  region  is  nearly  a  rectangle  with  area  rArAO,  and  the 
volume  under  the  surface  is  approximately 


££/(>*>  0/>*ArA0. 


In  the  limit,  this  turns  into  a  double  integral 


rn 


f(r,  0)rdrd0. 


"'0 


Example  14.4:  Volume  of  One-Eighth  of  a  Sphere 


Find  the  volume  under  z  —  \J A  — r2  above  the  quarter  circle  bounded  by  the  two  axes  and  the  circle 
x2  +  y2  =  4  in  the  first  quadrant. 


Solution.  In  terms  of  r  and  0,  this  region  is  described  by  the  restrictions  0  <  r  <  2  and  0  <  0  <  7t/2,  so 
we  have 


—  r 2 rdrdO 


-1(4 


2 


do 

0 


Ak 


3  ' 
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The  surface  is  a  portion  of  the  sphere  of  radius  2  centered  at  the  origin,  in  fact  exactly  one-eighth  of  the 
sphere.  We  know  the  formula  for  volume  of  a  sphere  is  (4/3)  Tfr3,  so  the  volume  we  have  computed  is 
(1/8)  (4/3) tt23  =  (4/3)7T,  in  agreement  with  our  answer.  X 

This  example  is  much  like  a  simple  one  in  rectangular  coordinates:  the  region  of  interest  may  be 
described  exactly  by  a  constant  range  for  each  of  the  variables.  As  with  rectangular  coordinates,  we  can 
adapt  the  method  to  deal  with  more  complicated  regions. 


Solution.  The  region  is  described  in  polar  coordinates  by  the  inequalities  —% /2  <  0  <  71  /2  and  0  <  r  < 
2  cos  0,  so  the  double  integral  is 


—  r 2  rdrdO. 


We  can  rewrite  the  integral  as  shown  because  of  the  symmetry  of  the  volume;  this  avoids  a  complication 
during  the  evaluation.  Proceeding: 


fit  1 2  /-2COS0  . -  rn/z  i 

2/  /  \J  A  —  r1  r  dr  dQ  —2  /  ~  (4  -r2)3/2 

Jo  Jo  Jo  3 

,7T/2  g 


^/2  1 


2  cos  0 


de 


=  2 


=  2 


o  3 
8  cos3  0 


-  sin  0  +  -d6 


3  3 


COS0  +  2^ 


8  32 


7T/2 

0 


Figure  14.8:  Volume  over  a  region  with  non-constant  limits. 
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You  might  have  learned  a  formula  for  computing  areas  in  polar  coordinates.  It  is  possible  to  compute 
areas  as  volumes,  so  that  you  need  only  remember  one  technique.  Consider  the  surface  z—  1,  a  horizontal 
plane.  The  volume  under  this  surface  and  above  a  region  in  the  x-y  plane  is  simply  1  •  (area  of  the  region), 
so  computing  the  volume  really  just  computes  the  area  of  the  region. 


Solution  The  region  is  described  by  7r/6  <  0  <5n/6  and  2  <  r  <  4sin0,  so  the  integral  is 


r5n/6  r  4sin0 
In/ 6  J2 


1 rdrdO  = 


r5*/6  l  , 

L/6  2r 

r5n/6 


4sin0 


d0 


ln/6 


8  sin  6  —  2  d6 


—  —K  +  2\/3. 


Figure  14.9:  Finding  area  by  computing  volume. 


X 


Exercises  for  14.2 


Exercise  14.2.1 
Exercise  14.2.2 
Exercise  14.2.3 
Exercise  14.2.4 
Exercise  14.2.5 


Find  die  volume  above  the  x-y  plane,  under  the  surface  r2  —  2 z,  and  inside  r 
Find  the  volume  inside  both  r  —  1  and  r2  +  z2  —  4. 

Find  the  volume  below  z,  —  \J  1  —  r2  and  above  the  top  half  of  the  cone  z  —  r. 
Find  the  volume  below  z  —  r,  above  the  x-y  plane,  and  inside  r  =  cos  0. 

Find  the  volume  below  z  —  r,  above  the  x-y  plane,  and  inside  r  —  1  +  cos  0. 


2. 
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Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 

Exercise 


14.2.6  Find  the  volume  between  x1  +  v2  =  z2  and  x2  +y2  —  z. 

14.2.7  Find  the  area  inside  r  —  1  +  sin  0  and  outside  r  —  2  sin  0. 

14.2.8  Find  the  area  inside  both  r  —  2  sin  0  and  r  —  2  cos  0. 

14.2.9  Find  the  area  inside  the  four-leaf  rose  r  =  cos(2  0)  and  outside  r  —  1/2. 

14.2.10  Find  the  area  inside  the  cardioid  r  =  2(1  +  cos  0)  and  outside  r  =  2. 

14.2.11  Find  the  area  of  one  loop  of  the  three-leaf  rose  r  =  cos(30). 

r3  ry/9—x1 

14.2.12  Compute  /  /  sin  (x  -\-y  )  dydx  by  converting  to  polar  coordinates. 

J- 3  Jo 

fa  r°  2 

14.2.13  Compute  /  /  _ x  y dydx  by  converting  to  polar  coordinates. 

Jo  J —fa2— x2 

14.2.14  Find  the  volume  under  z  =  y2  +  x  +  2  above  the  region  x2  +  y2  <  4 

14.2.15  Find  the  volume  between  z  —  x2yi  and  z—  1  above  the  region  x2  +  y2  <  1 

14.2.16  Find  the  volume  inside  x2  +y2  —  1  and  x2  +z2  —  1. 

14.2.17  Find  the  volume  under  z  —  r  above  r  —  3  +  cos  0. 

14.2.18  Figure  14.10  shows  the  plot  of  r—  1  +4sin(50). 


(a)  Describe  the  behavior  of  the  graph  in  terms  of  the  given  equation.  Specifically,  explain  maximum 
and  minimum  values,  number  of  leaves,  and  the  ’leaves  within  leaves’. 

(b)  Give  an  integral  or  integrals  to  determine  the  area  outside  a  smaller  leaf  but  inside  a  larger  leaf. 

(c)  How  would  changing  the  value  of  a  in  the  equation  r  =  1  +acos(50)  change  the  relative  sizes  of 
the  inner  and  outer  leaves?  Focus  on  values  a  >  1.  (Hint:  How  would  we  change  the  maximum  and 
minimum  values ?) 
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Exercise  14.2.19  Consider  the  integral 


D 


dA,  where  D  is  the  unit  disk  centered  at  the  origin. 


- \Jx2+y 2 

(a)  Why  might  this  integral  be  considered  improper? 

(b)  Calculate  the  value  of  the  integral  of  the  same  function  1  /  y/x2  +  y2  over  the  annulus  with  outer 
radius  1  and  inner  radius  8. 

(c)  Obtain  a  value  for  the  integral  on  the  whole  disk  by  letting  8  approach  0. 

(d)  For  which  values  X  can  we  replace  the  denominator  with  (x2  +  y2)^  in  the  original  integral? 


14.3  Moment  and  Center  of  Mass 


Using  a  single  integral  we  were  able  to  compute  the  center  of  mass  for  a  one-dimensional  object  with 
variable  density,  and  a  two  dimensional  object  with  constant  density.  With  a  double  integral  we  can  handle 
two  dimensions  and  variable  density. 

Just  as  before,  the  coordinates  of  the  center  of  mass  are 

_  My  _  Mx 

x  =  — -  y  =  — , 

M  M 

where  M  is  the  total  mass,  My  is  the  moment  around  the  y-axis,  and  Mx  is  the  moment  around  the  x-axis. 
(You  may  want  to  review  the  concepts  in  Section  8.6.) 

The  key  to  the  computation,  just  as  before,  is  the  approximation  of  mass.  In  the  two-dimensional  case, 
we  treat  density  <7  as  mass  per  square  area,  so  when  density  is  constant,  mass  is  (density) (area).  If  we 
have  a  two-dimensional  region  with  varying  density  given  by  o(x,y),  and  we  divide  the  region  into  small 
subregions  with  area  A A,  then  the  mass  of  one  subregion  is  approximately  <7(x;,y7)AA,  the  total  mass  is 
approximately  the  sum  of  many  of  these,  and  as  usual  the  sum  turns  into  an  integral  in  the  limit: 

fx\  ry i 

M  —  /  <j(x,y)dydx , 

J  xq  Jy  o 

and  similarly  for  computations  in  cylindrical  coordinates.  Then  as  before 

m  ry  i 

Mx=  /  yo(x,y)dydx 

•lx o  Jyo 
rx i  ry i 

My—  /  xo(x,y)dydx. 

Jx o  Jyo 


Example  14.7:  Center  of  Mass  of  Uniform  Plate 


Find  the  center  of  mass  of  a  thin,  uniform  plate  whose  shape  is  the  region  between  y  =  cosx  and  the 
x-axis  between  x  —  —  tt/2  and  x—  k/ 2. 
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Solution.  Since  the  density  is  constant,  we  may  take  o(x,y)  —  1. 

It  is  clear  that  x  —  0,  but  for  practice  let’s  compute  it  anyway.  First  we  compute  the  mass: 


M 


rTl/2  rCOSX 

-k/2  Jo 


1  dydx 


cosxdx  —  sinxl 


it/ 2 
—it/ 2 


2. 


Next, 


Finally, 


So  x  =  0  as  expected,  and  y  —  tt/4/2  =  7t/8.  This  is  the  same  problem  as  in  Example  8.21;  it  may  be 
helpful  to  compare  the  two  solutions.  & 


Example  14.8:  Center  of  Mass  of  2-D  Plate 


Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  quarter  circle  x2  +  y2  <  1  in 
the  first  quadrant  and  has  density  k(x2  +  y2). 


Solution.  It  seems  clear  that  because  of  the  symmetry  of  both  the  region  and  the  density  function  (both 
are  important!),  x  —  y.  We’ll  do  both  to  check  our  work. 

Jumping  right  in: 


M  = 


1  /•%/  l—x2 


0  JO 


r  1  - -  ( Y JC^)3/2 

k(x2 +y2)  dydx  —  k  r\/l-r2  +  - - - - dx. 


This  integral  is  something  we  can  do,  but  it’s  a  bit  unpleasant.  Since  everything  in  sight  is  related  to  a 
circle,  let’s  back  up  and  try  polar  coordinates.  Then  x2  +y2  —  r2  and 


/■7t/2  rl  [■ 7T/Z  p* 

M—  /  /  k(r2)rdrd0—k  — 

Jo  Jo  Jo  4 


71/ 2  r4 


1 


0 


f71!2  1  K 

d9=k  -d0  =  k—. 
Jo  4  8 


Much  better.  Next,  since  y  —  r  sin  0, 

f;r/2  j\ 


Mr  —  k 


r  sinOdrdO  —  k 


/  o  Jo 


flt/2  !  I 

/  -sind  dO  —  k  —  -cosO 
I  o  5  5 


Similarly, 


My  =  k 


ctz/2  ft 


r4  cos  9  dr  d6  =  k 


/o  Jo 


rn/2  l  l 

/  —  cos  9  d9  —  k  —  sin  9 

lo  5  5 


71/2  _k 
o 

71/2  _k_ 

o  _5‘ 


Finally,  x  —  y  —  — . 

jK 
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Exercises  for  14.3 


Exercise  14.3.1  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  square  [0, 1]  x  [0,1] 
and  has  density  function  xy. 

Exercise  14.3.2  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  triangle  0  <  x  <  1, 
0  <  y  <  x,  and  has  density  function  xy. 

Exercise  14.3.3  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  upper  unit  semicircle 
centered  at  (0,0)  and  has  density  function  y. 

Exercise  14.3.4  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  upper  unit  semicircle 
centered  at  (0,0)  and  has  density  function  x2. 

Exercise  14.3.5  Find  the  center  of  mass  of  a  two -dimensioned  plate  that  occupies  the  triangle  formed  by 
x  —  2  ,y  —  x,  and  y  =  2x  and  has  density  function  2.x. 

Exercise  14.3.6  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  triangle  formed  by 
x  —  0,  y  —  x,  and  2x  +  y  —  6  and  has  density  function  x  . 

Exercise  14.3.7  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  region  enclosed  by 
the  parabolas  x  —  y2,  y  —  x2  and  has  density  function  \fx. 

Exercise  14.3.8  Find  the  centroid  of  the  area  in  the  first  quadrant  bounded  by  x2  —  8y  + 4  =  0,  x2  =  Ay, 
and  x  —  0.  (Recall  that  the  centroid  is  the  center  of  mass  when  the  density  is  1  everywhere .) 

Exercise  14.3.9  Find  the  centroid  of  one  loop  of  the  three-leaf  rose  r  =  cos(30).  (Recall  that  the  centroid 
is  the  center  of  mass  when  the  density  is  1  everywhere,  and  that  the  mass  in  this  case  is  the  same  as  the 
area,  which  was  the  subject  of  Exercise  14.2.11  in  Section  14.2.)  The  computations  of  the  integrals  for  the 
moments  Mx  and  My  are  elementary  but  quite  long;  Sage  can  help. 

Exercise  14.3.10  Find  the  center  of  mass  of  a  two  dimensioned  object  that  occupies  the  region  0  <  x  <  K, 
0  <  y  <  sinx,  with  density  <7  =  1. 

Exercise  14.3.11  A  two-dimensional  object  has  shape  given  by  r  —  1  +  cos  0  and  density  (J  ( r,  6 )  =2  + 
cos  0.  Set  up  the  three  integrals  required  to  compute  the  center  of  mass. 

Exercise  14.3.12  A  two-dimensional  object  has  shape  given  by  r  —  cos  0  and  density  <j(r,  0)  =  r  +  1.  Set 
up  the  three  integrals  required  to  compute  the  center  of  mass. 

Exercise  14.3.13  A  two-dimensional  object  sits  inside  r  —  1  +cos  0  and  outside  r  —  cos  0,  and  has  density 
1  everywhere.  Set  up  the  integrals  required  to  compute  the  center  of  mass. 
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14.4  Surface  Area 


We  next  seek  to  compute  the  area  of  a  surface  above  (or  below)  a  region  in  the  x-y  plane.  How  might 
we  approximate  this?  We  start,  as  usual,  by  dividing  the  region  into  a  grid  of  small  rectangles.  We  want 
to  approximate  the  area  of  the  surface  above  one  of  these  small  rectangles.  The  area  is  very  close  to  the 
area  of  the  tangent  plane  above  the  small  rectangle.  If  the  tangent  plane  just  happened  to  be  horizontal, 
of  course  the  area  would  simply  be  the  area  of  the  rectangle.  For  a  typical  plane,  however,  the  area  is  the 
area  of  a  parallelogram,  as  indicated  in  Figure  14.11.  Note  that  the  area  of  the  parallelogram  is  obviously 
larger  the  more  “tilted”  the  tangent  plane. 


Figure  14.11:  Small  parallelograms  at  points  of  tangency. 

Now  recall  a  curious  fact:  the  area  of  a  parallelogram  can  be  computed  as  the  cross  product  of  two 
vectors.  We  simply  need  to  acquire  two  vectors,  parallel  to  the  sides  of  the  parallelogram  and  with  lengths 
to  match.  But  this  is  easy:  in  the  x  direction  we  use  the  tangent  vector  we  already  know,  namely  (1,0 ,fx) 
and  multiply  by  Ax  to  shrink  it  to  the  right  size:  (Ax,0,fxAx).  In  the  y  direction  we  do  the  same  thing  and 

get  (0,  Ay,  fyAy) .  The  cross  product  of  these  vectors  is  (fx,fy,  —  l)  AxAy  with  length  ^ +  1  Ax  Ay, 
the  area  of  the  parallelogram.  Now  we  add  these  up  and  take  the  limit,  to  produce  the  integral 

/  [  Jf?+fy+ldydx- 
Jy o  v 

As  before,  the  limits  need  not  be  constant. 


Solution.  We  compute  the  derivatives 


fx  = 


fx  = 
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and  then  the  area  is 


1  fVl—  X2  I  X 2  y2 

+  - - -o - o  +  1  dydx. 


J- 1  J-fuFF  y  1  —  x2  —  v2  1  —  x2  —  y2 

This  is  a  bit  on  the  messy  side,  but  we  can  use  polar  coordinates: 


f2n  rl  /  ^ 


rdrdO. 


I o  Jo  V  1  —  r2 

This  integral  is  improper,  since  the  function  is  undefined  at  the  limit  1 .  We  therefore  compute 


lim 

a-5’1'70 


n  [  \  - ~  rdr  —  lim  —  y/ 1  —  a2  +  1  =  1, 

l-  Jo  VI  ~rA  a-> t- 


using  the  substitution  u  =  1  —  r2.  Then  the  area  is 

[■2k 
JO 


1  dG  =  In. 


You  may  recall  that  the  area  of  a  sphere  of  radius  r  is  4;rr2,  so  half  the  area  of  a  unit  sphere  is  (1  /2)4tt  =  2tt, 


in  agreement  with  our  answer. 


X 


Exercises  for  14.4 


Exercise  14.4.1  Find  die  area  of  die  surface  of  a  right  circular  cone  of  height  li  and  base  radius  a. 

Exercise  14.4.2  Find  the  area  of  the  portion  of  the  plane  z  =  mx  inside  the  cylinder  x2  +y2  =  a2. 

Exercise  14.4.3  Find  the  area  of  the  portion  of  the  plane  x  +  y  +  z  —  1  in  the  first  octant. 

Exercise  14.4.4  Find  the  area  of  the  upper  half  of  the  cone  x2+y2  —  z2  inside  the  cylinder  x2  +y2  —  2x  = 

0. 

Exercise  14.4.5  Find  the  area  of  the  upper  half  of  the  cone  x2  -\-y2  =  z2  above  the  interior  of  one  loop  of 
r  —  cos(2  0). 

Exercise  14.4.6  Find  the  area  of  the  upper  hemisphere  of  x2  +  y2  +z2  =  1  above  the  interior  of  one  loop 
ofr  —  cos(2  9). 

Exercise  14.4.7  The  plane  ax  +  by  +  cz  —  d  cuts  a  triangle  in  the  first  octant  provided  that  a,b,c  and  d 
are  all  positive.  Find  the  area  of  this  triangle. 


Exercise  14.4.8  Find  the  area  of  the  portion  of  the  cone  x2  +y2  —  3 z2  lying  above  the  xy  plane  and  inside 
the  cylinder  x2  +y2  =  4y. 


512  Multiple  Integration 


14.5  Triple  Integrals 


It  will  come  as  no  surprise  that  we  can  also  do  triple  integrals — integrals  over  a  three-dimensional  region. 
The  simplest  application  allows  us  to  compute  volumes  in  an  alternate  way. 

To  approximate  a  volume  in  three  dimensions,  we  can  divide  the  three-dimensional  region  into  small 
rectangular  boxes,  each  Ax  x  Ay  x  Az  with  volume  AxAyAz.  Then  we  add  them  all  up  and  take  the  limit, 
to  get  an  integral: 

rx  i  ry  i  rzi 
/  /  /  dzdydx. 

Jxq  Jy0  Jzo 

Of  course,  if  the  limits  are  constant,  we  are  simply  computing  the  volume  of  a  rectangular  box. 


Example  14.10:  Volume  of  a  Box 


Compute  the  volume  of  the  box  with  opposite  corners  at  (0,0,0)  and  (1,2,3). 


Solution.  We  use  an  integral  to  compute  the  volume  of  the  box: 

m  dzdydx—  /  /  z\ndydx—  /  /  3 dvdx  —  /  3y\^,dx  =  /  6dx  —  6. 

Jo  Jo  '  Jo  Jo  '  Jo  "  Jo 


X 


Of  course,  this  is  more  interesting  and  useful  when  the  limits  are  not  constant. 


Example  14.11:  Volume  of  a  Tetrahedron 


Find  the  volume  of  the  tetrahedron  with  comers  at  (0,0,0),  (0,3,0),  (2,3,0),  and  (2,3,5). 


Solution.  The  whole  problem  comes  down  to  correctly  describing  the  region  by  inequalities:  0  <  x  <  2, 
3x/2  <y  <3,0  <z<  5x/2.  The  lowery  limit  comes  from  the  equation  of  the  liney  =  3x/2  that  forms  one 
edge  of  the  tetrahedron  in  the  x-y  plane;  the  upper  z  limit  comes  from  the  equation  of  the  plane  z  —  5x/2 
that  forms  the  “upper”  side  of  the  tetrahedron;  see  Figure  14.12.  Now  the  volume  is 


15jt 

15x3 

4 

12 

o 
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=  15-10  =  5. 


X 


Figure  14.12:  A  tetrahedron. 

Pretty  much  just  the  way  we  did  for  two  dimensions  we  can  use  triple  integration  to  compute  mass, 
center  of  mass,  and  various  average  quantities. 


Example  14.12:  Average  Temperature  in  a  Cube 


Suppose  the  temperature  at  a  point  is  given  by  T  —  xyz.  Find  the  average  temperature  in  the  cube 
with  opposite  corners  at  (0,0,0)  and  (2,2,2). 


Solution  In  two  dimensions  we  add  up  the  temperature  at  “each”  point  and  divide  by  the  area;  here  we 
add  up  the  temperatures  and  divide  by  the  volume,  8: 


i  r2  f 2  r2 

8 


/Z  [~  r xyzdzdydx=\  T 
Jo  Jo  Jo  8  Jo  Jo 


xyzr 


1  f2  r2 


dydx  =  — 

0  Io,/o  Jo 


1  f2  xy2 

4  Jo  ”2” 


dx  =  -  [  4 xdx  =  -  — 

8./0  2  2 


xydydx 
2  2 


=  1. 


0 


Example  14.13:  Mass  &  Center  of  Mass  of  Tetrahedron 


Suppose  the  density  of  an  object  is  given  by  xz,  and  the  object  occupies  the  tetrahedron  with  corners 
(0,0,0),  (0,1,0),  (1,1,0),  and  (0,1,1).  Find  the  mass  and  center  of  mass  of  the  object. 
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Solution  As  usual,  the  mass  is  the  integral  of  density  over  the  region: 

"1  flrtv-rl 2  1  /■!  JC(1  — JC)3 


m  =  r  r  rxzdzMx=  r 

Jo  Jx  Jo  Jo  Jx  2 

1 


o  3 


dx 


6  Jo 


x  —  3x2  +  3x3  —  x4  dx 


120 


We  compute  moments  as  before,  except  now  there  is  a  third  moment: 


xz~  dzdydx  = 


1 


360’ 


xy  zdzdydx  =  — — , 

144 


m.v-Jt  1 

x  zdzdydx —  — 
36 


360 

Finally,  the  coordinates  of  the  center  of  mass  are  x  =  Myz/M  —  1/3,  y  =  Mxz/M  —  5/6,  and  z  —  M^/M  = 
1/3.  ’  '  X 


Exercises  for  14.5 


mx+y 

2x  +  y—  l  dzdydx. 


r2  rx2  ry 

Exercise  14.5.2  Evaluate  /  /  /  xyzdzdy dx. 

Jo  J- 1  J  l 


mlny 

ex+y+z  dzdydx. 

. 


rTt/2  /-sin0  rr cos  6 

Exercise  14.5.4  Evaluate  /  /  /  r2dzdrdO. 

Jo  Jo  Jo 

rK  rsin  8  Msin0 

Exercise  14.5.5  Evaluate  /  /  /  rcos2  OdzdrdO. 

Jo  Jo  Jo 


r  i  ry  r*+y 

Exercise  14.5.6  Evaluate  /  /  /  xdzdxdy. 

Jo  Jo  Jo 


ny  /-In  (y+z) 

/  ex  dxdzdy. 

Jo 


r7l  /*7t/2  /*1 

Exercise  14.5.8  Compute  /  /  /  zsinx  +  zcosy dzdydx. 

Jo  Jo  Jo 
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Exercise  14.5.9  For  each  of  the  integrals  in  the  previous  exercises,  give  a  description  of  the  volume  ( both 
algebraic  and  geometric)  that  is  the  domain  of  integration. 


Exercise  14.5.10  Compute 


x  +  y  +  z  dV  over  the  region  inside  x2  A  y2  A  z2  <  1  in  the  first  octant. 


Exercise  14.5.11  Find  the  mass  of  a  cube  with  edge  length  2  and  density  equal  to  the  square  of  the 
distance  from  one  corner. 


Exercise  14.5.12  Find  the  mass  of  a  cube  with  edge  length  2  and  density  equal  to  the  square  of  the 
distance  from  one  edge. 

Exercise  14.5.13  An  object  occupies  the  volume  of  the  upper  hemisphere  of  x2  +y2  +  z2  —  4  and  has 
density  z.  at  (. x,y,z )■  Find  the  center  of  mass. 

Exercise  14.5.14  An  object  occupies  the  volume  of  the  pyramid  with  corners  at  (1,1,0),  (1,  — 1,0), 
(— 1,  — 1,0),  (  —  1, 1,0),  and  (0,0,2)  and  has  density  x2  Ay2  at  ( x,y,z ).  Find  the  center  of  mass. 


Exercise  14.5.15  Verify  the  moments  Mxy,  Mxz,  and  Myz  of  Example  14.13  by  evaluating  the  integrals. 


Exercise  14.5.16  Find  the  region  E  for  which 


z2)  dV  is  a  maximum. 


14.6  Cylindrical  and  Spherical  Coordinates 


We  have  seen  that  sometimes  double  integrals  are  simplified  by  doing  them  in  polar  coordinates;  not 
surprisingly,  triple  integrals  are  sometimes  simpler  in  cylindrical  coordinates  or  spherical  coordinates.  To 
set  up  integrals  in  polar  coordinates,  we  had  to  understand  the  shape  and  area  of  a  typical  small  region  into 
which  the  region  of  integration  was  divided.  We  need  to  do  the  same  thing  here,  for  three  dimensional 
regions. 

The  cylindrical  coordinate  system  is  the  simplest,  since  it  is  just  the  polar  coordinate  system  plus  a 
z  coordinate.  A  typical  small  unit  of  volume  is  the  shape  shown  in  Figure  14.7  “fattened  up”  in  the  z 
direction,  so  its  volume  is  rArAOAz,  or  in  the  limit,  rdrdOdz. 


Example  14.14:  Finding  Volume 


Find  the  volume  under  z  —  V 4  —  r2  above  the  quarter  circle  inside x2  Ay2  —  4  in  the  first  quadrant. 


Solution.  We  could  of  course  do  this  with  a  double  integral,  but  we’ll  use  a  triple  integral: 


p7t/ 2  r2  p\j 4 — p2  _ 

/  /  /  rdzdrdO  —  /  /  y  4  —  r2rdrd6 

Jo  Jo  Jo  Jo  Jo 


4k 

T' 


X 


Compare  this  to  Example  14.5. 
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Example  14.15:  Mass  using  Cylindrical  Coordinates 


An  object  occupies  the  space  inside  both  the  cylinder  x2  +  y2  =  1  and  the  sphere  x2  +y2  +z2  =  4, 
and  has  density  x2  at  ( x,y,z ).  Find  the  total  mass. 


Solution  We  set  this  up  in  cylindrical  coordinates,  recalling  that  x  =  r  cos  6  : 

r2n  r\ 


r2K  /■!  r\j4—r2 


0  Jo  ,/-vCr 


r  cos* (Q)dzdrdO  — 


4  —  r2  r3  cos2(6) drdO 

o  JO 

r271  f  128  22 


/  o 


— —  \/3  )  cos2(0)r/0 


128  22  r- , 


X 

Spherical  coordinates  are  somewhat  more  difficult  to  understand.  The  small  volume  we  want  will  be 
defined  by  A p,  A (j),  and  A0,  as  pictured  in  Figure  14.13.  The  small  volume  is  nearly  box  shaped,  with  4 
flat  sides  and  two  sides  formed  from  bits  of  concentric  spheres.  When  Ap,  A  </>,  and  AO  are  all  very  small, 
the  volume  of  this  little  region  will  be  nearly  the  volume  we  get  by  treating  it  as  a  box.  One  dimension  of 
the  box  is  simply  Ap,  the  change  in  distance  from  the  origin.  The  other  two  dimensions  are  the  lengths  of 
small  circular  arcs,  so  they  are  rAoc  for  some  suitable  r  and  a,  just  as  in  the  polar  coordinates  case. 


Figure  14.13:  A  small  unit  of  volume  for  spherical  coordinates. 


The  easiest  of  these  to  understand  is  the  arc  corresponding  to  a  change  in  <j> ,  which  is  nearly  identical 
to  the  derivation  for  polar  coordinates,  as  shown  in  the  left  graph  in  Figure  14.14.  In  that  graph  we  are 
looking  “face  on”  at  the  side  of  the  box  we  are  interested  in,  so  the  small  angle  pictured  is  precisely  A<j>, 
the  vertical  axis  really  is  the  z  axis,  but  the  horizontal  axis  is  not  a  real  axis — it  is  just  some  line  in  the  x-y 
plane.  Because  the  other  arc  is  governed  by  0,  we  need  to  imagine  looking  straight  down  the  z  axis,  so 
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that  the  apparent  angle  we  see  is  AO.  In  this  view,  the  axes  really  are  the  x  and  y  axes.  In  this  graph,  the 
apparent  distance  from  the  origin  is  not  p  but  psintj),  as  indicated  in  the  left  graph. 

z  y 


Figure  14.14:  Setting  up  integration  in  spherical  coordinates. 


The  upshot  is  that  the  volume  of  the  little  box  is  approximately  Ap(pA0)(p  sin  PAO)  =  p2  sin  pApApAO. 
or  in  the  limit  p2  sin  (j)  dp  d(j)  dO. 


Example  14.16:  Average  Temperature  in  a  Unit  Sphere 


Suppose  the  temperature  at  (x,y,z)  isT  —  1/(1  +„t2  +_v2  +  z2).  Find  the  average  temperature  in  the 
unit  sphere  centered  at  the  origin. 


Solution.  In  two  dimensions  we  add  up  the  temperature  at  “each”  point  and  divide  by  the  area;  here  we 
add  up  the  temperatures  and  divide  by  the  volume,  (4/3)  tt: 


2  r  1  rVl—x2  r\J  l—x2—y2  J 

te L J  v,  v  l  +V dzdydx 

This  looks  quite  messy;  since  everything  in  the  problem  is  closely  related  to  a  sphere,  we’ll  convert  to 
spherical  coordinates. 


3 

Ak 


i  2 

^p  sin (jtdpdtffdO 


3 

4  K 


(4  7Z-7Z2) 


X 


Exercises  for  14.6 


r i  rx  rVx +y~  (x2+y2)3/2 

Exercise  14.6.1  Evaluate  /  /  /  —5 - ~ ~  dzdydx . 

Jo  Jo  Jo  x~+yz  +  zz 
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ft  r  ■/ 1  — X2  ry/l— .V2—  y2 


Exercise  14.6.2  Evaluate  I  I  I  _  -\A2  +  y2  +  z2  dz  dv  dx. 

J 0  J \/x1+y1 


Exercise  14.6.3  Evaluate 
z  =  5. 


over  t/re  interior  of  the  cylinder  x2  +  y2  =  1  between  z  —  0  and 


Exercise  14.6.4  Evaluate 
z  —  5. 


.rye/ 17  over  //re  interior  of  the  cylinder  x2  +  y2  =  1  between  z  =  0  and 


Exercise  14.6.5  Evaluate 
under  x2  +  y2  +z2  —  4. 

Exercise  14.6.6  Evaluate 
under  x2  +  y2  +  z2  =  4. 


zdV  over  the  region  above  the  x-y  plane,  inside  x2  +y2  —  2x  —  0  and 


yzdV  over  the  region  in  the  first  octant,  inside  x2  +y2  —  2x  —  0  and 


Exercise  14.6.7  Evaluate  I  I  I  x2  +y2  dV  over  the  interior  of  xL  +yl  +  z1  —  4. 


2  ,  .,2  ,  _2 


Exercise  14.6.8  Evaluate  III  \/ x2  +y2  dV  over  the  interior  of  x2  +  y2  +  z2  —  4. 


Exercise  14.6.9  Compute  j  J  j  x  +  y  +  zdV  over  the  region  inside  x2  +y2  +z2  =  1  in  the  first  octant. 

Exercise  14.6.10  Find  the  mass  of  a  right  circular  cone  of  height  h  and  base  radius  a  if  the  density  is 
proportional  to  the  distance  from  the  base. 

Exercise  14.6.11  Find  the  mass  of  a  right  circular  cone  of  height  h  and  base  radius  a  if  the  density  is 
proportioned  to  the  distance  from  its  axis  of  symmetry. 

Exercise  14.6.12  An  object  occupies  the  region  inside  the  unit  sphere  at  the  origin,  and  has  density  equal 
to  the  distance  from  the  x-axis.  Find  the  mass. 

Exercise  14.6.13  An  object  occupies  the  region  inside  the  unit  sphere  at  the  origin,  and  has  density  equal 
to  the  square  of  the  distance  from  the  origin.  Find  the  mass. 

Exercise  14.6.14  An  object  occupies  the  region  between  the  unit  sphere  at  the  origin  and  a  sphere  of 
radius  2  with  center  at  the  origin,  and  has  density  equal  to  the  distance  from  the  origin.  Find  the  mass. 

Exercise  14.6.15  An  object  occupies  the  region  in  the  first  octant  bounded  by  the  cones  <b  —  Jl/4  and 
(j)  =  arctan2,  and  the  sphere  p  =  \/6,  and  has  density  proportional  to  the  distance  from  the  origin.  Find 
the  mass. 
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14.7  Change  of  Variables 


One  of  the  most  useful  techniques  for  evaluating  integrals  is  substitution,  both  “w-substitution”  and  trigono¬ 
metric  substitution,  in  which  we  change  the  variable  to  something  more  convenient.  As  we  have  seen, 
sometimes  changing  from  rectangular  coordinates  to  another  coordinate  system  is  helpful,  and  this  too 
changes  the  variables.  This  is  certainly  a  more  complicated  change,  since  instead  of  changing  one  variable 
for  another  we  change  an  entire  suite  of  variables,  but  as  it  turns  out  it  is  really  very  similar  to  the  kinds  of 
change  of  variables  we  already  know  as  substitution. 


Figure  14.15:  Single  change  of  variable. 


Let’s  examine  the  single  variable  case  again,  from  a  slightly  different  perspective  than  we  have  previ¬ 
ously  used.  Suppose  we  start  with  the  problem 

/  x2 \j  1  —  x2  dx; 

Jo 

this  computes  the  area  in  the  left  graph  of  Figure  14.15.  We  use  the  substitution  x  =  sin  m  to  transform  the 
function  from  x2\/l  —  x2  to  sin2 u  VT  —  sin2  u,  and  we  also  convert  dx  to  cos udu.  Finally,  we  convert  the 


limits  0  and  1  to  0  and  n/ 2.  This  transforms  the  integral: 


k2  \J  1  —  x2  dx  — 


ck/2 


sin2  u 


\J  1  —  sin2  ucosudu. 


We  want  to  notice  that  there  are  three  different  conversions:  the  main  function,  the  differential  dx ,  and  the 
interval  of  integration.  The  function  is  converted  to  sin2w\/ 1  —  sin2w,  shown  in  the  right-hand  graph  of 
Figure  14.15.  It  is  evident  that  the  two  curves  pictured  there  have  the  same  y-values  in  the  same  order,  but 
the  horizontal  scale  has  been  changed.  Even  though  the  heights  are  the  same,  the  two  integrals 

[  x 2v/ 1  —x2dx  and  /  sin 2  u\/ 1  —  sin 2  udu 

Jo  Jo 


are  not  the  same;  clearly  the  right  hand  area  is  larger.  One  way  to  understand  the  problem  is  to  note  that 
if  both  areas  are  approximated  using,  say,  ten  subintervals,  that  the  approximating  rectangles  on  the  right 
are  wider  than  their  counterparts  on  the  left,  as  indicated.  In  the  picture,  the  width  of  the  rectangle  on  the 
left  is  Ax'  =  0.1,  between  0.7  and  0.8.  The  rectangle  on  the  right  is  situated  between  the  corresponding 
values  arcsin(0.7)  and  arcsin(0.8)  so  that  A u  =  arcsin(0.8)  —  arcsin(0.7).  To  make  the  widths  match,  and 
the  areas  therefore  the  same,  we  can  multiply  A u  by  a  correction  factor;  in  this  case  the  correction  factor 
is  approximately  cosw  =  cos(arcsin(0.7)),  which  we  compute  when  we  convert  dx  to  cos  udu. 
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Now  let’s  move  to  functions  of  two  variables.  Suppose  we  want  to  convert  an  integral 


rx i  ryi 

/  /  f(x,y)dydx 

Jxq  -ho 


to  use  new  variables  u  and  v.  In  the  single  variable  case,  there’s  typically  just  one  reason  to  want  to  change 
the  variable:  to  make  the  function  “nicer”  so  that  we  can  find  an  antiderivative.  In  the  two  variable  case, 
there  is  a  second  potential  reason:  the  two-dimensional  region  over  which  we  need  to  integrate  is  somehow 
unpleasant,  and  we  want  the  region  in  terms  of  u  and  v  to  be  nicer — to  be  a  rectangle,  for  example.  Ideally, 
of  course,  the  new  function  and  the  new  region  will  be  no  worse  than  the  originals,  and  at  least  one  of 
them  will  be  better;  this  doesn’t  always  pan  out. 

As  before,  there  are  three  parts  to  the  conversion:  the  function  itself  must  be  rewritten  in  terms  of  u 
and  v,  dydx  must  be  converted  to  dudv,  and  the  old  region  must  be  converted  to  the  new  region.  We  will 
develop  the  necessary  techniques  by  considering  a  particular  example,  and  we  will  use  an  example  we 
already  know  how  to  do  by  other  means. 

Consider 


The  limits  correspond  to  integrating  over  the  top  half  of  a  circular  disk,  and  we  recognize  that  the  function 
will  simplify  in  polar  coordinates,  so  we  would  normally  convert  to  polar  coordinates: 


K 

3' 


But  let’s  instead  approach  this  as  a  substitution  problem,  starting  with  x  —  r cos  0,  y  =  rsin  0.  This  pair  of 
equations  describes  a  function  from  “r-6  space”  to  “x-y  space”,  and  because  it  involves  familiar  concepts, 
it  is  not  too  hard  to  understand  what  it  does.  In  Figure  14.16  we  have  indicated  geometrically  a  bit  about 
how  this  function  behaves.  The  four  dots  labeled  a-d  in  the  r-6  plane  correspond  to  the  three  dots  in  the 
x-y  plane;  dots  a  and  b  both  go  to  the  origin  because  r  =  0.  The  horizontal  arrow  in  the  r-6  plane  has 
r  =  1  everywhere  and  6  ranges  from  0  to  n,  so  the  corresponding  points  x  —  rcosO,  y  —  rsin 6  start  at 
(1,0)  and  follow  the  unit  circle  counter-clockwise.  Finally,  the  vertical  arrow  has  6  =  7t/4  and  r  ranges 
from  0  to  1,  so  it  maps  to  the  straight  arrow  in  the  x-y  plane.  Extrapolating  from  these  few  examples,  it’s 
not  hard  to  see  that  every  vertical  line  in  the  r-6  plane  is  transformed  to  a  line  through  the  origin  in  the  x-y 
plane,  and  every  horizontal  line  in  the  r-6  plane  is  transformed  to  a  circle  with  center  at  the  origin  in  the 
x-y  plane.  Since  we  are  interested  in  integrating  over  the  half-disk  in  the  x-y  plane,  we  will  integrate  over 
the  rectangle  [0,7r]  x  [0, 1]  in  the  r-6  plane,  because  we  now  see  that  the  points  in  this  rectangle  are  sent 
precisely  to  the  upper  half  disk  by  x  —  r cos  6  and  y  =  rsin  0. 


Figure  14.16:  Double  change  of  variable. 
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At  this  point  we  are  two-thirds  done  with  the  task:  we  know  the  r-0  limits  of  integration,  and  we  can 
easily  convert  the  function  to  the  new  variables: 

\fx2  -\-y2  —  \J r 2  cos2  0  +  r2  sin2  0  —  r\/ cos2  0  +  sin2  0  —  r.  (14.1) 


The  final,  and  most  difficult,  task  is  to  figure  out  what  replaces  dxdy.  (Of  course,  we  actually  know  the 
answer,  because  we  are  in  effect  converting  to  polar  coordinates.  What  we  really  want  is  a  series  of  steps 
that  gets  to  that  right  answer  but  that  will  also  work  for  other  substitutions  that  are  not  so  familiar.) 

Let’s  take  a  step  back  and  remember  how  integration  arises  from  approximation.  When  we  approxi¬ 
mate  the  integral  in  the  x-y  plane,  we  are  computing  the  volumes  of  tall  thin  boxes,  in  this  case  boxes  that 
are  Ax  x  Ay  x  \J x2  +  y2.  We  are  aiming  to  come  up  with  an  integral  in  the  r-0  plane  that  looks  like  this: 

r  U-OdrdO.  (14.2) 

Jo  Jo 

What  we’re  missing  is  exactly  the  right  quantity  to  replace  the  “?”  so  that  we  get  the  correct  answer.  Of 
course,  this  integral  is  also  the  result  of  an  approximation,  in  which  we  add  up  volumes  of  boxes  that  are 
Ar  x  AO  x  height;  the  problem  is  that  the  height  that  will  give  us  the  correct  answer  is  not  simply  r.  Or  put 
another  way,  we  can  think  of  the  correct  height  as  r,  but  the  area  of  the  base  ArAO  as  being  wrong.  The 
height  r  comes  from  Equation  14.1,  which  is  to  say,  it  is  precisely  the  same  as  the  corresponding  height  in 
the  x-y  version  of  the  integral.  The  problem  is  that  the  area  of  the  base  Ar  x  Ay  is  not  the  same  as  the  area 
of  the  base  Arx  AO.  We  can  think  of  the  “?”  in  the  integral  as  a  correction  factor  that  is  needed  so  that 
IdrdQ  =  dxdy. 

So  let’s  think  about  what  that  little  base  Ar  x  AO  corresponds  to.  We  know  that  each  bit  of  horizontal 
line  in  the  r-0  plane  corresponds  to  a  bit  of  circular  arc  in  the  x-y  plane,  and  each  bit  of  vertical  line  in  the 
r-0  plane  corresponds  to  a  bit  of  “radial  line”  in  the  x-y  plane.  In  Figure  14. 17  we  show  a  typical  rectangle 
in  the  r-0  plane  and  its  corresponding  area  in  the  x-y  plane. 


r 


1  - 


0 


Ar 


AO 


- r  0  ~r 

Figure  14.17:  Corresponding  areas. 
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In  this  case,  the  region  in  the  x-y  plane  is  approximately  a  rectangle  with  dimensions  Ar  x  rAO,  but  in 
general  the  comer  angles  will  not  be  right  angles,  so  the  region  will  typically  be  (almost)  a  parallelogram. 
We  need  to  compute  the  area  of  this  parallelogram.  We  know  a  neat  way  to  do  this:  compute  the  length  of 
a  certain  cross  product.  If  we  can  determine  an  appropriate  two  vectors  we’ll  be  nearly  done. 

Fortunately,  we’ve  really  done  this  before.  The  sides  of  the  region  in  the  x-y  plane  are  formed  by  tem¬ 
porarily  fixing  either  r  or  0  and  letting  the  other  variable  range  over  a  small  interval.  In  Figure  14.17,  for 
example,  the  upper  right  edge  of  the  region  is  formed  by  fixing  0  =  2tt / 3  and  letting  r  run  from  0.5  to  0.75. 
In  other  words,  we  have  a  vector  function  v(r)  =  (r cos  0o,  rsin  0o,  0) ,  and  we  are  interested  in  a  restricted 
set  of  values  for  r.  A  vector  tangent  to  this  path  is  given  by  the  derivative  v'(r)  =  (cos  0O, sin  0q,O),  and 
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a  small  tangent  vector,  with  length  approximately  equal  to  the  side  of  the  region,  is  (cos  0O, sin  0o,O)  dr. 
Likewise,  if  we  fix  r  —  ro  =  0.5,  we  get  the  vector  function  w (0)  =  (ro cos 0, rosin 0,0)  with  derivative 
w(0)  =  (— rosin 0,ro cos 0,0)  and  a  small  tangent  vector  (— rosin 0o,ro cos Oo,0)dO  when  G  —  Go  (at  the 
comer  we’re  focusing  on).  These  vectors  are  shown  in  Figure  14.18,  with  the  actual  region  outlined  by  a 
dotted  boundary.  Of  course,  since  both  A r  and  A0  are  quite  large,  the  parallelogram  is  not  a  particularly 
good  approximation  to  the  true  area. 


“ i  i-  A 

-1  0  1 

Figure  14.18:  The  approximating  parallelogram. 


The  area  of  this  parallelogram  is  the  length  of  the  cross  product: 


(— rosin  0o,  ro  cos  0q,O)  r/0  x  (cos  0o,sin0o,O)  dr 


i 

— rosin0o 
cos  0o 


j 

rocos  0o 
sin  0o 


k 

0 

0 


dOdr 


(0, 0,  —  ro  sin2  Gq  —  ro  cos2  Gq)  dG  dr 


(0,0,—ro)dQdr. 


The  length  of  this  vector  is  rodrdG.  So  in  general,  for  any  values  of  r  and  0,  the  area  in  the  x-y  plane 
corresponding  to  a  small  rectangle  anchored  at  (0,r)  in  the  r-G  plane  is  approximately  rdrdG.  In  other 
words,  “r”  replaces  the  “?”  in  Equation  14.2. 

In  general,  a  substitution  will  start  with  equations  jc  =  f(u,  v)  and  y  —  g(u ,  v).  Again,  it  will  be  straight¬ 
forward  to  convert  the  function  being  integrated.  Converting  the  limits  will  require,  as  above,  an  under¬ 
standing  of  just  how  the  functions  /  and  g  transform  the  u-v  plane  into  the  x-y  plane.  Finally,  the  small 
vectors  we  need  to  approximate  an  area  will  be  {fu,gu,  0)  du  and  (f,,gv,  0)  dv.  The  cross  product  of  these 
is  (0,0  Jugv  - gufv)  dudv  with  length  \  fugv  -  gufv\dudv.  The  quantity  |  fugv  -  gufv\  is  usually  denoted 


d(s,y) 

d(u,v) 


I  fugv  gufv 


and  called  the  Jacobian.  Note  that  this  is  the  absolute  value  of  the  two  by  two  determinant 


Ju  gu 
fv  gv 


which  may  be  easier  to  remember.  (Confusingly,  the  matrix,  the  determinant  of  the  matrix,  and  the  absolute 
value  of  the  determinant  are  all  called  the  Jacobian  by  various  authors.) 

Because  there  are  two  things  to  worry  about,  namely,  the  form  of  the  function  and  the  region  of 
integration,  transformations  in  two  (or  more)  variables  are  quite  tricky  to  discover. 
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Example  14.17:  Integral  of  an  Ellipse 


Integrate  x2  —  xy  +  y2  over  the  region  x2  -xy-\-y2  <  2. 


Solution.  The  equation  x2  —  xy  -\-y2  —  2  describes  an  ellipse  as  in  Figure  14.19;  the  region  of  integration 
is  the  interior  of  the  ellipse.  We  will  use  the  transformation  x  —  \[2u  —  a/2/3v,  y  —  \f2u  +  a/ 2/3v. 
Substituting  into  the  function  itself  we  get 

x2  —  xv  +  y2  —  2  u2  +  2v2. 

The  boundary  of  the  ellipse  is  x2  —  xy  +  y2  =2,  so  the  boundary  of  the  corresponding  region  in  the  u-v 
plane  is  2 u2  +  2v2  =  2  or  n2  +  v2  =  1,  the  unit  circle,  so  this  substitution  makes  the  region  of  integration 
simpler. 

Next,  we  compute  the  Jacobian,  using  /  =  \[2u  —  a/ 2/3v  and  g  =  y/2 u  +  \j2j2v. 

fugv-gufv  =  V2^2j3  +  V2^2j3  = 

Hence  the  new  integral  is 

JJ  (2u2  +  2v2)— —dudv, 

where  R  is  the  interior  of  the  unit  circle.  This  is  still  not  an  easy  integral,  but  it  is  easily  transformed  to 
polar  coordinates,  and  then  easily  integrated. 


Figure  14.19:  x2  —  xy  +  y2  —  2 


There  is  a  similar  change  of  variables  formula  for  triple  integrals,  though  it  is  a  bit  more  difficult  to 
derive.  Suppose  we  use  three  substitution  functions,  x  —  f(u,v,w),  y  —  g(u,v,w),  and  z  =  h(u,v,w).  The 
Jacobian  determinant  is  now 

fu  gu  hu 
fv  gv  K  . 
fw  gw  hw 

Then  the  integral  is  transformed  in  a  similar  fashion: 


d(x,y,z) 

d(u,v,w) 


1 1 J F(f(u,v,w),g(u,v,w),h(u,v,w)) 


d(x,y,z ) 
d(u,v,w) 


where  of  course  the  region  S  in  uvw  space  corresponds  to  the  region  R  in  xyz  space. 


dudvdw. 


524  Multiple  Integration 


Exercises  for  14.7 


Exercise  14.7.1  Complete  Example  14.17  by  converting  to  polar  coordinates  and  evaluating  the  integral. 


Exercise  14.7.2  Evaluate  j J  xydxdy  over  the  square  with  corners  (0,0),  (1, 1),  (2,0),  and  (1,-1)  in  two 
ways:  directly ,  and  using  x  —  (u  +  v)/2,  y  —  (u  —  v)/2. 

Exercise  14.7.3  Evaluate  JJ x2  +y2dxdy  over  the  square  with  corners  (  —  1,0),  (0, 1),  (1,0),  and  (0,-1) 
in  two  ways:  directly,  and  using  x  =  (u  +  v)/2,  y  —  (u  —  v)/2. 

Exercise  14.7.4  Evaluate  JJ (x  +  y)ex~y  dxdy  over  the  triangle  with  corners  (0,0),  (  —  1, 1),  and  (1, 1)  in 
two  ways:  directly,  and  using  x  —  (u  +  v)/2,  y  —  (u  —  v)/2. 


Exercise  14.7.5  Evaluate  J  J  y(x  —  y)  dxdy  over  the  parallelogram  with  corners  (0,0),  (3,3),  (7,3),  and 
(4, 0)  in  two  ways:  directly,  and  using  x  —  u  +  v,  y  —  u. 


Exercise  14.7.6  Evaluate  J  J  \J x2  Ey2  dxdy  over  the  triangle  with  corners  (0,0),  (4,4),  and  (4,0)  using 
x  —  u,  y  —  uv. 

Exercise  14.7.7  Evaluate  JJ  ysin(xy)  dxdy  over  the  region  bounded  by  xy  —  xy  —  4,  y  —  1,  and  y  —  4 
using  x  =  u/v,  y  =  v. 

Exercise  14.7.8  Evcducite  JJ  sin(9x2  +  4 v2 )  dA,  over  the  region  in  the  first  quadrant  bounded  by  the 
ellipse  9x2  4-  4v2  =  1. 

Exercise  14.7.9  Compute  the  Jacobian  for  the  substitutions  x  =  p  sin  (j)  cos  9,  y  —  p  sin  (j)  sin  9,  z  —  p  cos  (j). 


Exercise  14.7.10  Evaluate  /  /  /  dV  where  E  is  the  solid  enclosed  by  the  ellipsoid 


2^2 
y~  z 

~ o  +  To  H - o  — 


a 


2  1  b2 


using  the  transformation  x  —  au,  y  —  bv,  and  z  =  cw. 


15.  Vector  Functions 


15.1  Curves  of  Vector  Functions 


We  have  already  seen  that  a  convenient  way  to  describe  a  line  in  three  dimensions  is  to  provide  a  vector 
that  “points  to”  every  point  on  the  line  as  a  parameter  t  varies,  like 

(1,2,3)  +f(l,— 2,2)  =  (1  +f,2  — 2f,3  +  2f). 

Except  that  this  gives  a  particularly  simple  geometric  object,  there  is  nothing  special  about  the  indi¬ 
vidual  functions  of  t  that  make  up  the  coordinates  of  this  vector — any  vector  with  a  parameter,  like 
if(t)’g(t),h(t)),  will  describe  some  curve  in  three  dimensions  as  t  varies  through  all  possible  values. 


Solution.  As  t  varies,  the  first  two  coordinates  in  all  three  functions  trace  out  the  points  on  the  unit  circle, 
starting  with  (1,0)  when  t  —  0  and  proceeding  counter-clockwise  around  the  circle  as  t  increases.  In  the 
first  case,  the  z  coordinate  is  always  0,  so  this  describes  precisely  the  unit  circle  in  the  x-y  plane.  In  the 
second  case,  the  x  and  y  coordinates  still  describe  a  circle,  but  now  the  z  coordinate  varies,  so  that  the 
height  of  the  curve  matches  the  value  of  t.  When  t  =  n,  for  example,  the  resulting  vector  is  (— 1,0.  n). 
A  bit  of  thought  should  convince  you  that  the  result  is  a  helix.  In  the  third  vector,  the  z  coordinate  varies 
twice  as  fast  as  the  parameter  t,  so  we  get  a  stretched  out  helix.  Both  are  shown  in  Figure  15.1.  On  the  left 
is  the  first  helix,  shown  for  1  between  0  and  47t;  on  the  right  is  the  second  helix,  shown  for  t  between  0  and 
2k.  Both  start  and  end  at  the  same  point,  but  the  first  helix  takes  two  full  “turns”  to  get  there,  because  its 
z  coordinate  grows  more  slowly.  X 
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Figure  15.1:  Two  helixes. 

A  vector  expression  of  the  form  r (t)  —  ( f(t),g(t),h(t ))  is  called  a  vector  function;  it  is  a  function 
from  the  real  numbers  M  to  the  set  of  all  three-dimensional  vectors.  We  can  alternately  think  of  it  as  three 
separate  functions,  x  =  f(t),  y  =  g(t),  and  z  —  h(t),  that  describe  points  in  space.  In  this  case  we  usually 
refer  to  the  set  of  equations  as  parametric  equations  for  the  curve,  just  as  for  a  line.  While  the  parameter 
t  in  a  vector  function  might  represent  any  one  of  a  number  of  physical  quantities,  or  be  simply  a  “pure 
number”,  it  is  often  convenient  and  useful  to  think  of  t  as  representing  time.  The  vector  function  then  tells 
you  where  in  space  a  particular  object  is  at  any  time. 

Vector  functions  can  be  difficult  to  understand,  that  is,  difficult  to  picture.  When  available,  computer 
software  can  be  very  helpful.  When  working  by  hand,  one  useful  approach  is  to  consider  the  “projections” 
of  the  curve  onto  the  three  standard  coordinate  planes.  We  have  already  done  this  in  part:  in  Example  15.1 
we  noted  that  all  three  curves  project  to  a  circle  in  the  x-y  plane,  since  (cos/, sin/}  is  a  two  dimensional 
vector  function  for  the  unit  circle. 


Example  15.2:  Projections  onto  a  Plane 


Graph  the  projections  of  (cost,  sin/,  2t)  onto  the  x-z  plane  and  they-z  plane. 


Solution.  The  two  dimensional  vector  function  for  the  projection  onto  the  x-z  plane  is  (cos/,2t),  or  in 
parametric  form,  x  =  cost,  z  =  2 1.  By  eliminating  t  we  get  the  equation  x  =  cos(z/2),  the  familiar  curve 
shown  on  the  left  in  Figure  15.2.  For  the  projection  onto  the  y-z  plane,  we  start  with  the  vector  function 
(sint,2r),  which  is  the  same  as  y  —  sinf,  z  —  2 1.  Eliminating  t  gives  y  —  sin(z/2),  as  shown  on  the  right  in 
Figure  15.2.  £ 
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Figure  15.2:  The  projections  of  (cos t,  sin t.  2t)  onto  the  x-z  and  y-z  planes. 


Exercises  for  15.1 


Exercise  15.1.1  Investigate  the  cun’e  r(t)  —  (sin/, cost, cos  8/). 

Exercise  15.1.2  Investigate  the  cur\’e  r(t )  =  (/cos/, /sin/,/). 

Exercise  15.1.3  Investigate  the  cur\’e  r(t)  =  ( t,t2,cost ). 

Exercise  15.1.4  Investigate  the  cur\’e  r(t )  =  (cos(20z)\/l  —  t2,  sin(20/)v/l  —t2,t) 

Exercise  15.1.5  Find  a  vector  function  for  the  curve  of  intersection  ofx2+y2  —  9  andy  +  z  —  2. 

Exercise  15.1.6  A  bug  is  crawling  outward  along  the  spoke  of  a  wheel  that  lies  along  a  radius  of  the 
wheel.  The  bug  is  crawling  at  1  unit  per  second  and  the  wheel  is  rotating  at  1  radian  per  second.  Suppose 
the  wheel  lies  in  the  y-z  plane  with  center  at  the  origin,  and  at  time  t  —  0  the  spoke  lies  along  the  positive 
y  axis  and  the  bug  is  at  the  origin.  Find  a  vector  function  r(t)  for  the  position  of  the  bug  at  time  t. 

Exercise  15.1.7  What  is  the  difference  between  the  parametric  curves  /(/)  —  ( /,/,/ 2),  g(t)  —  (t2,t2,t4), 
and  h(t )  =  (sin(/),sin(/),sin2(/))n,5' t  runs  over  all  real  numbers? 

Exercise  15.1.8  Plot  each  of  the  curves  below  in  2  dimensions,  projected  onto  each  of  the  three  standard 
planes  ( the  x-y,  x-z,  and  y-z  planes). 

(a)  /(/)  =  (/,/3,/2),  t  ranges  over  all  real  numbers 

(b)  f(t)  —  (t2,t —  l,t2 +  5)  forO  <t  <3 
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Exercise  15.1.9  Given  points  A  =  (01,02,0.3)  and  B  =  (b\,b2,bf,  give  parametric  equations  for  the  line 
segment  connecting  A  and  B.  Be  sure  to  give  appropriate  t  values. 

Exercise  15.1.10  With  a  parametric  plot  and  a  set  oft  values,  we  can  associate  a  ‘direction  ’.  For  exam¬ 
ple,  the  curve  (cos/,  sin?)  is  the  unit  circle  traced  counterclockwise.  How  can  we  amend  a  set  of  given 
parametric  equations  and  t  values  to  get  the  same  curve,  only  traced  backwards? 


15.2  Calculus  with  Vector  Functions 


A  vector  function  r (t)  —  (f(t),g(t),h(t))  is  a  function  of  one  variable — that  is,  there  is  only  one  “input” 
value.  What  makes  vector  functions  more  complicated  than  the  functions  y  =  f(x)  that  we  studied  in  the 
first  part  of  this  book  is  of  course  that  the  “output”  values  are  now  three-dimensional  vectors  instead  of 
simply  numbers.  It  is  natural  to  wonder  if  there  is  a  corresponding  notion  of  derivative  for  vector  functions. 
In  the  simpler  case  of  a  function  y  =  s(t),  in  which  t  represents  time  and  s(t)  is  position  on  a  line,  we  have 
seen  that  the  derivative  s\t)  represents  velocity;  we  might  hope  that  in  a  similar  way  the  derivative  of  a 
vector  function  would  tell  us  something  about  the  velocity  of  an  object  moving  in  three  dimensions. 

One  way  to  approach  the  question  of  the  derivative  for  vector  functions  is  to  write  down  an  expression 
that  is  analogous  to  the  derivative  we  already  understand,  and  see  if  we  can  make  sense  of  it.  If  we  say 
that  what  we  mean  by  the  limit  of  a  vector  is  the  vector  of  the  individual  coordinate  limits,  this  gives  us 


r(t  +  At)  —  rjt) 

At 

(f(t  +  At)  -  f(t),g(t  +  At)  -  g(t),h(t  +  At)  -  h(t)) 

At 

__  y  ,f{t A- At) -f{t)  gjt  +  At)  - g(t )  hit  +  At)- hit) 
a,"!a  At  ’  At  ’  At 
=  {f'{tU'{t\h'it)), 


r'(t)  =  lim 
Af— >0 


=  lim 

A/->0 


Starting  with  a  familiar  expression  for  what  appears  to  be  a  derivative,  we  find  that  we  can  make  good 
computational  sense  out  of  it — but  what  does  it  actually  mean? 

We  know  how  to  interpret  r(f  +  At)  and  r(/) — they  are  vectors  that  point  to  locations  in  space;  if  t  is 
time,  we  can  think  of  these  points  as  positions  of  a  moving  object  at  times  that  are  At  apart.  We  also  know 
what  Ar  =  r (t  +  At)  —  r (t)  means — it  is  a  vector  that  points  from  the  head  of  r (?)  to  the  head  of  r (t  +  At), 
assuming  both  have  their  tails  at  the  origin.  So  when  At  is  small,  Ar  is  a  tiny  vector  pointing  from  one 
point  on  the  path  of  the  object  to  a  nearby  point.  As  At  gets  close  to  0,  this  vector  points  in  a  direction  that 
is  closer  and  closer  to  the  direction  in  which  the  object  is  moving;  geometrically,  it  approaches  a  vector 
tangent  to  the  path  of  the  object  at  a  particular  point. 
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Figure  15.3:  Approximating  the  derivative. 


Unfortunately,  the  vector  Ar  approaches  0  in  length;  the  vector  (0, 0, 0)  is  not  very  informative.  By 
dividing  by  At,  when  it  is  small,  we  effectively  keep  magnifying  the  length  of  Ar  so  that  in  the  limit  it 
doesn’t  disappear.  Thus  the  limiting  vector  (f'(t),g'(t),h'(t))  will  (usually)  be  a  good,  non-zero  vector 
that  is  tangent  to  the  curve. 

What  about  the  length  of  this  vector?  It’s  nice  that  we’ve  kept  it  away  from  zero,  but  what  does  it 
measure,  if  anything?  Consider  the  length  of  one  of  the  vectors  that  approaches  the  tangent  vector: 


r(t  +  At)  —  r  (t) 

r(t-(-A/)  -r(t)\ 

At 

At 

The  numerator  is  the  length  of  the  vector  that  points  from  one  position  of  the  object  to  a  “nearby”  position; 
this  length  is  approximately  the  distance  travelled  by  the  object  between  times  t  and  t  +  At.  Dividing  this 
distance  by  the  length  of  time  it  takes  to  travel  that  distance  gives  the  average  speed.  As  At  approaches 
zero,  this  average  speed  approaches  the  actual,  instantaneous  speed  of  the  object  at  time  t. 

By  performing  an  “obvious”  calculation  to  get  something  that  looks  like  the  derivative  of  r(t),  we  get 
precisely  what  we  would  want  from  such  a  derivative:  the  vector  r '(t)  points  in  the  direction  of  travel  of 
the  object  and  its  length  tells  us  the  speed  of  travel.  In  the  case  that  t  is  time,  then,  we  call  v(t)  =  r'(t)  the 
velocity  vector.  Even  if  t  is  not  time,  r\t)  is  useful — it  is  a  vector  tangent  to  the  curve. 


Solution.  We  compute  r'  =  (— sint,cost,  1),  and  |r/|  =  y/ sin2 1  +  cos2 1  +  1  =  y/2.  Thinking  of  this  as  a 
description  of  a  moving  object,  its  speed  is  always  \fl\  see  Figure  15.4.  X 
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Figure  15.4:  A  tangent  vector  on  the  helix. 


Example  15.4: 


The  velocity  vector  for  (cost  ,sint ,  cost)  is  (—  sin/, cos/,  -  sin/).  As  before,  the  first  two  coordinates 
mean  that  from  above  this  curve  looks  like  a  circle.  The  z  coordinate  is  now  also  periodic,  so  that 
as  the  object  moves  around  the  curve  its  height  oscillates  up  and  down.  In  fact  it  turns  out  that  the 
curve  is  a  tilted  ellipse,  as  shown  in  Figure  15.5. 


Figure  15.5:  The  ellipse  r  =  (cost,  sint, cost). 


Example  15.5: 


The  velocity  vector  for  (cost,  sint,  cos  2t)  is  (—  sint,cost,  —  2sin2t).  The  z  coordinate  is  now  oscil¬ 
lating  twice  as  fast  as  in  the  previous  example,  so  the  graph  is  not  surprising;  see  Figure  15.6. 
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Figure  15.6:  (cos /,  sin  f,  cos  2/). 


Example  15.6: 


Find  the  angle  between  the  curves  (t  +  l,t  —  2,t2 1 4}  and  (t  +  2,  —  t  +  3,/2  +  1)  where  they  meet. 


Solution.  The  angle  between  two  curves  at  a  point  is  the  angle  between  their  tangent  vectors — any  tangent 
vectors  will  do,  so  we  can  use  the  derivatives.  We  need  to  find  the  point  of  intersection,  evaluate  the  two 
derivatives  there,  and  finally  find  the  angle  between  them. 

To  find  the  point  of  intersection,  we  need  to  solve  the  equations 

t  T  1  —  u  +  2 
t  —  2  —  —u  +  3 
t2  —  4  =  u2  +  \ 

Solving  either  of  the  first  two  equations  for  u  and  substituting  in  the  third  gives  t  —  3.  This  together  with 
u  —  2  satisfies  all  three  equations.  Thus  the  two  curves  meet  at  (4, 1,5),  the  first  curve  when  t  —  3  and  the 
second  curve  when  t  —  2. 

The  derivatives  are  (1, 1,2 1)  and  (1,  — 1,2/);  at  the  intersection  point  these  are  (1, 1,6)  and  (1,  — 1,4). 
The  cosine  of  the  angle  between  them  is  then 

n  1-1+24  4 

cos  0  =  . —  . —  =  — =, 

-*/38\/l8  y/19 


so  0  —  arccos(4/\/l9)  p=i  0.41. 


X 


The  derivatives  of  vector  functions  obey  some  familiar  looking  rules,  which  we  will  occasionally  need. 
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Note  that  because  the  cross  product  is  not  commutative  you  must  remember  to  do  the  three  cross 
products  in  formula  (5.)  in  the  correct  order. 

When  the  derivative  of  a  function  f[t)  is  zero,  we  know  that  the  function  has  a  horizontal  tangent 
line,  and  may  have  a  local  maximum  or  minimum  point.  If  r \t)  =  0,  the  geometric  interpretation  is  quite 
different,  though  the  interpretation  in  terms  of  motion  is  similar.  Certainly  we  know  that  the  object  has 
speed  zero  at  such  a  point,  and  it  may  thus  be  abruptly  changing  direction.  In  three  dimensions  there  are 
many  ways  to  change  direction;  geometrically  this  often  means  the  curve  has  a  cusp  or  a  point,  as  in  the 
path  of  a  ball  that  bounces  off  the  floor  or  a  wall. 
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Figure  15.7:  (1  +  t3,t2, 1}  has  a  cusp  at  (1,0, 1). 


Sometimes  we  will  be  interested  in  the  direction  of  r'  but  not  its  length.  In  some  cases,  we  can  still 
work  with  r',  as  when  we  find  the  angle  between  two  curves.  On  other  occasions  it  will  be  useful  to  work 
with  a  unit  vector  in  the  same  direction  as  r';  of  course,  we  can  compute  such  a  vector  by  dividing  r'  by 
its  own  length.  This  standard  unit  tangent  vector  is  usually  denoted  by  T: 

_  r' 

t=m' 

In  a  sense,  when  we  computed  the  angle  between  two  tangent  vectors  we  have  already  made  use  of  the 
unit  tangent,  since 

/  /  , 

r 

IFi 


cos  6  — 


r'  -s' 


r  s' 


Now  that  we  know  how  to  make  sense  of  r',  we  immediately  know  what  an  antiderivative  must  be, 
namely 

j  r(t)dt  —  ( J  f(t)dt,  J g(t)dt,  j h(t)dt). 


if  r  =  (f(t),g(t),h(t)).  What  about  definite  integrals?  Suppose  that  v(t)  gives  the  velocity  of  an  object  at 
time  t.  Then  v(t)At  is  a  vector  that  approximates  the  displacement  of  the  object  over  the  time  At:  v(t)At 
points  in  the  direction  of  travel,  and  |v(f)Af|  =  |v(f)||A/|  is  the  speed  of  the  object  times  At,  which  is 
approximately  the  distance  travelled.  Thus,  if  we  sum  many  such  tiny  vectors: 


11—  1 

£  v(tf)Af 
/= o 

we  get  an  approximation  to  the  displacement  vector  over  the  time  interval  [to,  t„] .  If  we  take  the  limit  we 
get  the  exact  value  of  the  displacement  vector: 


11—  1 

lim  £  v(t,-)At 
i= 0 


r(t„)  —  r(t0). 


Thus,  given  the  velocity  vector  we  can  compute  the  vector  function  r  giving  the  location  of  the  object: 
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Example  15.9: 


An  object  moves  with  velocity  vector  (cost,  sint,  cost),  starting  at  (1,1,1).  Find  the  function  r 
giving  its  location. 


Solution. 


r(t)  =  (1, 1, 1)  +  /  (cos  a,  sin  n,  cos  w)  du 
Jo 

—  (1, 1, 1)  +  (sinw,  —  cosw,sinM)|o 
=  (1, 1, 1)  +  (sint,  — cost,sint)  —  (0,  — 1,0) 
=  (1  +  sint, 2  — cost,  1  +  sint) 


See  Figure  15.8. 


Figure  15.8:  Path  of  the  object  with  its  initial  velocity  vector. 


Exercises  for  15.2 


Exercise  15.2.1  Find  r'  and  Tforr  =  (t2, 1 ,  t). 

Exercise  15.2.2  Findr'  and  Tforr  =  (cost,sin2t,t2). 
Exercise  15.2.3  Findr'  and  Tforr  =  (cos(ef),sin(ef),sint). 


Exercise  15.2.4  Find  a  vector  function  for  the  line  tangent  to  the  helix  (cost,  sint,  t)  when  t  —  7t/4. 
Exercise  15.2.5  Find  a  vector  function  for  the  line  tangent  to  (cost,  sint,  cos  4t)  when  l  —  n/3. 
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Exercise  15.2.6  Find  the  cosine  of  the  angle  between  the  cun’es  (0 ,t2,t)  and  (cos(7Tt/2),sin(7rt/2),t) 
where  they  intersect. 

Exercise  15.2.7  Find  the  cosine  of  the  angle  between  the  cun’es  (cost,  — sin(t)/4,sint)  and  (cost,sint,sin(2t)) 
where  they  intersect. 

Exercise  15.2.8  Suppose  that  \r(t  j  =  k,  for  some  constant  k.  This  means  that  r  describes  some  path  on 
the  sphere  of  radius  k  with  center  at  the  origin.  Show  that  r  is  perpendicular  to  r'  at  every  point.  Hint: 

Use  Theorem  15.7,  part  (d). 

Exercise  15.2.9  A  bug  is  crawling  along  the  spoke  of  a  wheel  that  lies  along  a  radius  of  the  wheel.  The 
bug  is  crawling  at  1  unit  per  second  and  the  wheel  is  rotating  at  1  radian  per  second.  Suppose  the  wheel 
lies  in  the  v-z  plane  with  center  at  the  origin,  and  at  time  t  —  0  the  spoke  lies  along  the  positive  y  axis  and 
the  bug  is  at  the  origin.  Find  a  vector  function  r(t)  for  the  position  of  the  bug  at  time  t,  the  velocity  vector 
r'(t),  the  unit  tangent  T(t),  and  the  speed  of  the  bug  \r'(t)\. 

Exercise  15.2.10  An  object  moves  with  velocity  vector  (cost,  sint,t),  starting  at  (0,0,0)  when  t  —  0.  Find 
the  function  r  giving  its  location. 

Exercise  15.2.11  The  position  function  of  a  particle  is  given  byr(t)  —  (t2, 5t,  t2  —  16t),  t  >  0.  When  is  the 
speed  of  the  particle  a  minimum? 

Exercise  15.2.12  A  particle  moves  so  that  its  position  is  given  by  (cost,sint,cos(6t)).  Find  the  maximum 
and  minimum  speeds  of  the  particle. 

Exercise  15.2.13  An  object  moves  with  velocity  vector  (t,t2,cost),  starting  at  (0,0,0)  when  t  —  0.  Find 
the  function  r  giving  its  location. 

Exercise  15.2.14  What  is  the  physical  interpretation  of  the  dot  product  of  two  vector  valued  functions? 
What  is  the  physical  interpretation  of  the  cross  product  of  two  vector  valued  functions? 

Exercise  15.2.15  Show,  using  the  rules  of  cross  products  and  differentiation,  that 

jt(r(t)  x/(t))=r(t)  xr"(t). 


Exercise  15.2.16  Determine  the  point  at  which f(t)  —  (t,t2,t3)  and  g(t)  —  (cos(t),cos(2t),t+  1)  intersect, 
and  find  the  angle  between  the  curves  at  that  point.  (Hint:  You’ll  need  to  set  this  one  up  like  a  line 
intersection  problem,  writing  one  in  s  and  one  in  t.)  If  these  two  functions  were  the  trajectories  of  two 
airplanes  on  the  same  scale  of  time,  would  the  planes  collide  at  their  point  of  intersection?  Explain. 

Exercise  15.2.17  Find  the  equation  of  the  plane  perpendicular  to  the  curve  r(t)  =  (2sin(3t),t,2cos(3t)) 
at  the  point  (0, 7T,  —2). 

Exercise  15.2.18  Find  the  equation  of  the  plane  perpendicular  to  (cost,sint,cos(6t))  when  t  —  tr/4. 
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Exercise  15.2.19  At  what  point  on  the  curve  r(t)  —  (/3,3/,/4)  is  the  plane  perpendicular  to  the  curx>e  also 
parallel  to  the  plane  6x  +  6y  —  8z  =  1  ? 

Exercise  15.2.20  Find  the  equation  of  the  line  tangent  to  (cos/,sin/,cos(6/))  when  t  —  n/A. 


15.3  Arc  Length 


Sometimes  it  is  useful  to  compute  the  length  of  a  curve  in  space;  for  example,  if  the  curve  represents  the 
path  of  a  moving  object,  the  length  of  the  curve  between  two  points  may  be  the  distance  travelled  by  the 
object  between  two  times. 

Recall  that  if  the  curve  is  given  by  the  vector  function  r  then  the  vector  Ar  —  r(t  +  At)  —  r (t)  points 
from  one  position  on  the  curve  to  another,  as  depicted  in  Figure  15.3.  If  the  points  are  close  together, 
the  length  of  Ar  is  close  to  the  length  of  the  curve  between  the  two  points.  If  we  add  up  the  lengths  of 
many  such  tiny  vectors,  placed  head  to  tail  along  a  segment  of  the  curve,  we  get  an  approximation  to  the 
length  of  the  curve  over  that  segment.  In  the  limit,  as  usual,  this  sum  turns  into  an  integral  that  computes 
precisely  the  length  of  the  curve.  First,  note  that 

I  Ar  I 

I  Ar  I  = - -At  \r'(t)\At, 

At 

when  At  is  small.  Then  the  length  of  the  curve  between  r(a)  and  r (b)  is 


n—  1  n—  1  l^j.1  n—  1 

lim  V  I  Ar  I  =  lim  Y  - — -A  t—  lim  Y  |r'(/)|A/ 

n^oo  “  n— >oo  “  /\f  n— >•  oo  — v 

i= 0  i=0  i=() 


r  '(t)\dt. 


It  is  worth  stating  that  this  works  if  between  a  and  b  the  segment  of  curve  is  traced  out  exactly  once. 
Recall  from  Figure  15.1  that  the  function  r  =  (cos/, sin/,/)  is  a  helix. 


Solution.  We  compute  r'  =  (—  sin/,  cos/,  1)  and  |r' 


\J sin2/ +  cos2/ +  1  =  y/2,  so  the  length  is 


5\/2  K  s/l% 

tT 


X 


Example  15.11:  Length  of  a  Curve 


Suppose  y  —  \nx;  what  is  the  length  of  this  curve  between  x  —  1  and  x  —  y/3? 
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Solution.  Although  this  problem  does  not  appear  to  involve  vectors  or  three  dimensions,  we  can  interpret 
it  in  those  terms:  let  r(/)  =  (t,\nt,0).  This  vector  function  traces  out  precisely  y  =  lnx  in  the  x-y  plane. 

Then  r ■'(/)  —  (1,  l//,0)  and  |r,(t)|  =  \J \  +  l//2  and  the  desired  length  is 


1  H — ~  dt 


2  —  v/2  +  ln(v/2+  1)  —  ^  In  3. 


(This  integral  is  a  bit  tricky,  but  requires  only  methods  we  have  learned.) 


X 


Notice  that  there  is  nothing  special  about  y  —  lnx,  except  that  the  resulting  integral  can  be  computed. 
In  general,  given  any  y  —  /(x),  we  can  think  of  this  as  the  vector  function  r (/)  —  (/,/(/), 0).  Then  r '(/)  = 

(1, /'(/), 0)  and  |r'(/)|  =  \J  1  +  (/')2.  The  length  of  the  curve  y  —  f(x)  between  a  and  b  is  thus 

Ja  \J 1  +(f(x))2dx. 

Unfortunately,  such  integrals  are  often  impossible  to  do  exactly  and  must  be  approximated. 

One  useful  application  of  arc  length  is  the  arc  length  parameterization.  A  vector  function  r (/)  gives 
the  position  of  a  point  in  terms  of  the  parameter  /,  which  is  often  time,  but  need  not  be.  Suppose  s  is 
the  distance  along  the  curve  from  some  fixed  starting  point;  if  we  use  s  for  the  variable,  we  get  r(s),  the 
position  in  space  in  terms  of  distance  along  the  curve.  We  might  still  imagine  that  the  curve  represents  the 
position  of  a  moving  object;  now  we  get  the  position  of  the  object  as  a  function  of  how  far  the  object  has 
travelled. 


Example  15.12: 


Suppose  r(t )  =  (cos/,  sin /,()).  We  know  that  this  curve  is  a  circle  of  radius  1.  While  t  might 
represent  time,  it  can  also  in  this  case  represent  the  usual  angle  between  the  positive  x-axis  and  r(t ) . 
The  distance  along  the  circle  from  (1,0,0)  to  (cost,  sin/,0)  is  also  t — this  is  the  definition  of  radian 
measure.  Thus,  in  this  case  s  —  t  and  r(s )  =  (cos s,  sins,  0). 


Example  15.13: 


Suppose  r(t)  —  (cos/, sin/,/).  We  know  that  this  curve  is  a  helix.  Find  the  distance  along  the  helix 
from  (1,0,0)  to  (cos/,  sin/,/)  and  use  this  to  write  the  arc  length  parameterization  r(s )  of  r(t). 


Solution.  The  distance  along  the  helix  from  (1,0,0)  to  (cos/, sin/,/)  is 

s—  \r'(u)\du—  /  y/ cos2 u  +  sin2 u+  1  du  —  /  \fldu  —  \flt. 

Jo  Jo  Jo 

Thus,  the  value  of  /  that  gets  us  distance  s  along  the  helix  is  /  =  s/ \/2,  and  so  the  same  curve  is  given  by 
r(s)  =  (cos(s/v/2),sin(s/v/2),s/v/2).  X 

In  general,  if  we  have  a  vector  function  r(/),  to  convert  it  to  a  vector  function  in  terms  of  arc  length  we 
compute 
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solve  s  —  f(t )  for  t,  getting  t  =  g(s),  and  substitute  this  back  into  r (/)  to  get  r(s)  =  r(g(s)). 
Suppose  that  t  is  time.  By  the  Fundamental  Theorem  of  Calculus,  if  we  start  with  arc  length 


and  take  the  derivative,  we  get 


A0  =  |r'(0l- 


Here  s'(t)  is  the  rate  at  which  the  arc  length  is  changing,  and  we  have  seen  that  |r'(/)|  is  the  speed  of  a 
moving  object;  these  are  of  course  the  same. 

Suppose  that  r(s)  is  given  in  terms  of  arc  length;  what  is  |r'(.y)|?  It  is  the  rate  at  which  arc  length  is 
changing  relative  to  arc  length ;  it  must  be  1 !  In  the  case  of  the  helix,  for  example,  the  arc  length  param¬ 
eterization  is  (cos(x/v/2),sin(5/v/2),5/v/2),  the  derivative  is  (—  sm(s / \/2) / \/2,cos(s / \/2) / \/2, \/s/2), 
and  the  length  of  this  is 


/ 


sin2(x/v/2)  cos2(x/v/2)  1 

2  1  2  h2 


So  in  general,  r'  is  a  unit  tangent  vector. 

Exercises  for  15.3 

Exercise  15.3.1  Find  the  length  q/' (3  cos/,  2/,  3  sin/),  t  G  [0,2/r]. 

Exercise  15.3.2  Find  the  length  of(t2,2,t3),  t  G  [0, 1]. 

Exercise  15.3.3  Find  the  length  p/'(/2,sin/,cos/),  t  G  [0, 1]. 

Exercise  15.3.4  Find  the  length  of  the  curve  y  —  x3/2,  x  G  [1,91- 

Exercise  15.3.5  Set  up  an  integral  to  compute  the  length  of  (cos/,sin/,c'),  t  G  [0,5].  (It  is  tedious  but  not 
too  difficult  to  compute  this  integral .) 


15.4  Curvature 


Given  a  curve  r(/),  we  would  like  to  be  able  to  measure,  at  various  points,  how  sharply  curved  it  is.  Clearly 
this  is  related  to  how  “fast”  a  tangent  vector  is  changing  direction,  so  a  first  guess  might  be  that  we  can 
measure  curvature  with  |r"(/)|.  A  little  thought  shows  that  this  is  flawed;  if  we  think  of  t  as  time,  for 
example,  we  could  be  tracing  out  the  curve  more  or  less  quickly  as  time  passes.  The  second  derivative 
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|r"(t)|  incorporates  this  notion  of  time,  so  it  depends  not  simply  on  the  geometric  properties  of  the  curve 
but  on  how  quickly  we  move  along  the  curve. 


Example  15.14: 


Consider  r(t)  =  (cos/,  sin;, 0)  and  s(t)  —  (cos  2/,  sin  2/,  0).  Both  of  these  vector  functions  represent 
the  unit  circle  in  the  x-y  plane,  but  if  t  is  interpreted  as  time,  the  second  describes  an  object  moving 
twice  as  fast  as  the  first.  Computing  the  second  derivatives,  we  find  \r"{t)\  —  1,  |s"(f)|  =  4. 


To  remove  the  dependence  on  time,  we  use  the  arc  length  parameterization.  If  a  curve  is  given  by  r(.y), 
then  the  first  derivative  r/(.s')  is  a  unit  vector,  that  is,  r'f.v)  =  T(.v).  We  now  compute  the  second  derivative 
r"{s)  =  T^y)  and  use  |T/(^)  |  as  the  “official”  measure  of  curvature,  usually  denoted  K. 


Example  15.15: 


We  have  seen  that  the  arc  length  parameterization  of  a  particular  helix  is 

r(s)  —  (cos  {s/Vi)  ,sin  (s/  Vi),s/  Vi) 

Computing  the  second  derivative  gives  r"(s)  —  (—  008(5/  v/2)/2,  —  sin(  s/  Vi)  / 2, 0)  with  length  1/2. 


What  if  we  are  given  a  curve  as  a  vector  function  r (t),  where  t  is  not  arc  length?  We  have  seen  that  arc 
length  can  be  difficult  to  compute;  fortunately,  we  do  not  need  to  convert  to  the  arc  length  parameterization 
to  compute  curvature.  Instead,  let  us  imagine  that  we  have  done  this,  so  we  have  found  t  =  g(^)  and  then 
formed  r (5)  =  r(g(y)).  The  first  derivative  r'{s)  is  a  unit  tangent  vector,  so  it  is  the  same  as  the  unit  tangent 
vector  T(t)  =  T (,?(.v) ) .  Taking  the  derivative  of  this  we  get 


^TteW)  =  T'feWV(S)=T'(0^. 


The  curvature  is  the  length  of  this  vector: 


K  = 


|T'(<)I  |T'(Q| 

\ds/dt\  |r'(t)|  ’ 


(Recall  that  we  have  seen  that  ds/dt  —  |r'(t)|.)  Thus  we  can  compute  the  curvature  by  computing  only 
derivatives  with  respect  to  t;  we  do  not  need  to  do  the  conversion  to  arc  length. 


Example  15.16: 


Returning  to  the  helix,  suppose  we  start  with  the  parameterization  r(t)  —  (cos/, sin/,;).  Calculate 
the  curvature  K. 


Solution.  First  r \t)  —  (— sint,cost,  1),  |r'(t)|  =  Vl,  and  T(t)  =  (— sint,cost,  l)/\/2.  Then 

T'(t)  =  (—  cost,  —  sint,0)/Vi 
|T'(t)|  =  1/V2 


540  Vector  Functions 


Finally,  k  =  l/Vl/Vl  =  1  /2,  as  before. 


X 


Example  15.17: 


Consider  this  circle  of  radius  a:  r(t )  =  (a  cost,  asm  t,  1).  Find  the  curvature,  K. 


Solution. First  r'(f)  =  (—asint, acost, 0).  Then  |r'(t)|  =  a/ a2 sin2 t  +  a2 cos2 1 
Now 

T'(t)  =  (— acost,—  asint,0)/a 


and  |T'(t)|  =  1.  Finally, 


K—l/a 


a,  andT(t)  =  (— asint, acost, 0)/a. 


the  curvature  of  a  circle  is  everywhere  the  inverse  of  the  radius.  It  is  sometimes  useful  to  think  of  curvature 
as  describing  what  circle  a  curve  most  resembles  at  a  point.  The  curvature  of  the  helix  in  the  previous 
example  is  1/2;  this  means  that  a  small  piece  of  the  helix  looks  very  much  like  a  circle  of  radius  2,  as 
shown  in  Figure  15.9.  X 


Figure  15.9:  A  circle  with  the  same  curvature  as  the  helix. 


Example  15.18: 


Consider  r(t)  —  (cos/, sin/, cos 2/),  as  shown  in  Figure  15.6.  Compute  the  curvature,  K. 


Solution  First  r'(t)  =  (— sint,cost,  — 2sin(2t))  and  |r'(t)|  =  1  +  4shr(2t),  so 

.  ,  sint  cost  — 2sin2t 

T(0  = 


'l+4sin2(2r)  yl+4sin2(2t)  yl+4sin2(2t) 

Computing  the  derivative  of  this  and  then  the  length  of  the  resulting  vector  is  possible  but  unpleasant. 
Fortunately,  there  is  an  alternate  formula  for  the  curvature  that  is  often  simpler  than  the  one  we  have: 

|r'(t)  x  r"(f)| 


k  = 
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We  compute  the  second  derivative  r "(t)  —  (—  cost,  —  sint,  — 4cos(2t)).  Then  the  cross  product  r' (l)  x 
r "(t)  is 

(— ■ 4  cost  cos  2t  —  2  sin  t  sin  2t,  2  cost  sin  2t  —  4sintcos2t,  1). 

Computing  the  length  of  this  vector  and  dividing  by  |r'(t)  |3  is  still  a  bit  tedious.  With  the  aid  of  a  computer 

-\/48cos4t  —  48  cos2 1  +  17 


we  get 


K 


(— 16cos4t  +  16cos2t+  1  )3/2 


Graphing  this  we  get 


Compare  this  to  Figure  15.6.  The  highest  curvature  occurs  where  the  curve  has  its  highest  and  lowest 
points,  and  indeed  in  the  picture  these  appear  to  be  the  most  sharply  curved  portions  of  the  curve,  while 
the  curve  is  almost  a  straight  line  midway  between  those  points.  £ 

Let’s  see  why  this  alternate  formula  is  correct.  Starting  with  the  definition  of  T,  r/  =  |r/|T  so  by  the 
product  rule  r"  =  | r' | 'T  +  | r' | T7 .  Then  by  Theorem  12.7  the  cross  product  is 

r'xr"  =  |r'|Tx  |r'|'T  + |r'|T  x  |r'|T' 

=  |r'||r'|'(TxT)  +  |r'|2(TxT') 

=  |rf(TxT') 

because  T  x  T  =  0,  since  T  is  parallel  to  itself.  Then 

|r'xr"|  =  | r' | 2 1 T  x  T'| 

=  | r7 1 2 1 T 1 1 T' |  sin0 
=  |r'|2|T'| 

using  Exercise  15.2.8  in  Section  15.2  to  see  that  0  =  iz/2.  Dividing  both  sides  by  |r'|3  then  gives  the 
desired  formula. 

We  used  the  fact  here  that  Tr  is  perpendicular  to  T;  the  vector  N  =  T,/|T/|  is  thus  a  unit  vector  perpen¬ 
dicular  to  T,  called  the  unit  normal  to  the  curve.  Occasionally  of  use  is  the  unit  binormal  B  =  TxN,a 
unit  vector  perpendicular  to  both  T  and  N. 
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Exercises  for  15.3 


Exercise  15.4.1  Find  the  curvature  of  (t,t2,t). 
Exercise  15.4.2  Find  the  curvature  of  (/,/2,/2). 
Exercise  15.4.3  Find  the  curvature  of(t,t3,t5). 
Exercise  15.4.4  Find  the  curvature  ofy  =  x4  at  (1, 1). 

15.5  Acceleration  Vectors 


We  have  already  seen  that  if  t  is  time  and  an  object’s  location  is  given  by  r(/),  then  the  derivative  r'(t)  is 
the  velocity  vector  v(/).  Just  as  v(/)  is  a  vector  describing  how  r (/)  changes,  so  is  V(t)  a  vector  describing 
how  v(/)  changes,  namely,  a (/)  —  v'(/)  —  r "(/)  is  the  acceleration  vector. 


Example  15.19: 


Suppose  r(t )  =  (cos/, sin/,  1).  Then  F(t)  =  v(t)  =  (— sin/,  cos/,0)  and  V(t)  =  a(t)  — 
(—cos/,  — sin/,0).  This  describes  the  motion  of  an  object  traveling  on  a  circle  of  radius  1,  with 
constant  z  coordinate  1.  The  velocity  vector  is  of  course  tangent  to  the  curve;  note  that  a  -  v=  0,  so 
v  and  a  are  perpendicular.  In  fact,  it  is  not  hard  to  see  that  a  points  from  the  location  of  the  object 
to  the  center  of  the  circular  path  at  (0,0, 1). 


Recall  that  the  unit  tangent  vector  is  given  by  T  =  v/|v|,  so  v  =  |v|T.  If  we  take  the  derivative  of  both 
sides  of  this  equation  we  get 


a  =  |v|'T  +  |v|T'.  (15.1) 

Also  recall  the  definition  of  the  curvature,  K  =  |  T7 1  / 1  v | ,  or  |T7 1  =  xjv|.  Finally,  recall  that  we  defined 
the  unit  normal  vector  as  N  =  T,/|T/|,  so  T;  =  |T7 |N  =  xjv|N.  Substituting  into  Equation  15.1  we  get 

a=  |v|'T+/c|v|2N.  (15.2) 

The  quantity  |v(/)|  is  the  speed  of  the  object,  often  written  as  v(/);  v(/)|'  is  the  rate  at  which  the  speed 
is  changing,  or  the  scalar  acceleration  of  the  object,  a(t).  Rewriting  Equation  15.2  with  these  gives  us 

a  =  aT  +  /cv2N  =  aT  T  +  a^  N; 

aj  is  the  tangential  component  of  acceleration  and  un  is  the  normal  component  of  acceleration.  We 

have  already  seen  that  a-/  measures  how  the  speed  is  changing;  if  you  are  riding  in  a  vehicle  with  large 
aj  you  will  feel  a  force  pulling  you  into  your  seat.  The  other  component,  ajv,  measures  how  sharply  your 
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direction  is  changing  with  respect  to  time.  So  it  naturally  is  related  to  how  sharply  the  path  is  curved, 
measured  by  K,  and  also  to  how  fast  you  are  going.  Because  a,v  includes  v2,  note  that  the  effect  of  speed 
is  magnified;  doubling  your  speed  around  a  curve  quadruples  the  value  of  c/,y .  You  feel  the  effect  of  this  as 
a  force  pushing  you  toward  the  outside  of  the  curve,  the  “centrifugal  force.” 

In  practice,  if  want  a #  we  would  use  the  formula  for  k: 


.12  lr'  X  r"l  |„/|2 

r'  x  r" 

v  x  a 

1  IO  - 

|r'| 

V 

To  compute  aj  we  can  project  a  onto  v: 


v  a 

aT  —  1 — p 

V 


„/  _// 

r  ■  r 


Solution.  Taking  derivatives  we  get  v  =  (l,2t,3t2)  and  a  =  (0,2,6 1).  Then 

4f+18f3  ,  \/4  +  36f 2  +  36t4 

cir  =  .  and  %  =  — . 

\/l  +  4t2  +  9t4  Vl+4t2  +  9t4 

* 


Exercises  for  15.5 


Exercise  15.5.1  Let  r  =  (t5,2t2,t).  Compute  v,  a,  aj,  and  on. 

Exercise  15.5.2  Letr  —  (cost,sint,t2).  Compute  v,  a,  aj,  and  a^. 

Exercise  15.5.3  Let  r  —  (cost,sint,ef).  Compute  v,  a,  a j,  and  ap. 

Exercise  15.5.4  Letr  —  (ef,sin  t,ef).  Compute  v,  a,  aj,  and  a,\i. 

Exercise  15.5.5  Suppose  an  object  moves  so  that  its  acceleration  is  given  by  a  —  (—3 cos t,  —2 sin/, 0).  At 
time  t  —  0  the  object  is  at  (3,0,0)  and  its  velocity  vector  is  (0,2,0).  Find  v(t)  and  r(t)  for  the  object. 

Exercise  15.5.6  Suppose  an  object  moves  so  that  its  acceleration  is  given  by  a  —  (—3 cos t,  — 2sin/,0).  At 
time  t  —  0  the  object  is  at  (3,0,0)  and  its  velocity  vector  is  (0,2. 1,0).  Find  v(t  )  and  r{t)  for  the  object. 

Exercise  15.5.7  Suppose  an  object  moves  so  that  its  acceleration  is  given  by  a  —  (—3 cos t,  —2 sin/, 0).  At 
time  t  —  0  the  object  is  at  (3,0,0)  and  its  velocity  vector  is  (0,2, 1).  Find  v{t)  and  r{t)  for  the  object. 
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Exercise  15.5.8  Suppose  an  object  moves  so  that  its  acceleration  is  given  by  a  —  (—3 cos t,  — 2sin/,0).  At 
time  t  —  0  the  object  is  at  (3,0,0)  and  its  velocity  vector  is  (0,2.1, 1).  Find  v(t)  and  r(t)for  the  object. 

Exercise  15.5.9  Describe  a  situation  in  which  the  normal  component  of  acceleration  is  0  and  the  tan¬ 
gential  component  of  acceleration  is  non-zero.  Is  it  possible  for  the  tangential  component  of  acceleration 
to  be  0  while  the  normal  component  of  acceleration  is  non-zero?  Explain.  Finally,  is  it  possible  for  an 
object  to  move  ( not  be  stationary)  so  that  both  the  tangential  and  normal  components  of  acceleration  are 
0?  Explain. 


16.  Vector  Calculus 


16.1  Vector  Fields 


This  chapter  is  concerned  with  applying  calculus  in  the  context  of  vector  fields.  A  two-dimensional  vector 
field  is  a  function  /  that  maps  each  point  (x,y)  in  M2  to  a  two-dimensional  vector  (w,v),  and  similarly 
a  three-dimensional  vector  field  maps  (x,y,z)  to  ( u,v,w ).  Since  a  vector  has  no  position,  we  typically 
indicate  a  vector  field  in  graphical  form  by  placing  the  vector  f(x,y)  with  its  tail  at  (x,y).  Figure  16.1 
shows  a  representation  of  the  vector  field  f(x,y )  =  (x/  \J x1  +y2  +  4,  —yj  \J x2  +  y2  +  4) .  For  such  a  graph 
to  be  readable,  the  vectors  must  be  fairly  short,  which  is  accomplished  by  using  a  different  scale  for  the 
vectors  than  for  the  axes.  Such  graphs  are  thus  useful  for  understanding  the  sizes  of  the  vectors  relative  to 
each  other  but  not  their  absolute  size. 
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Figure  16.1:  A  vector  field. 


Vector  fields  have  many  important  applications,  as  they  can  be  used  to  represent  many  physical  quan¬ 
tities:  the  vector  at  a  point  may  represent  the  strength  of  some  force  (gravity,  electricity,  magnetism)  or  a 
velocity  (wind  speed  or  the  velocity  of  some  other  fluid). 

We  have  already  seen  a  particularly  important  kind  of  vector  field — the  gradient.  Given  a  function 
f(x,y),  recall  that  the  gradient  is  f  =  V/  =  (fx(x,y),fy(x,y)),  a  vector  that  depends  on  (is  a  function  of)  x 
and  y.  We  usually  picture  the  gradient  vector  with  its  tail  at  (x,y),  pointing  in  the  direction  of  maximum 
increase.  Vector  fields  that  are  gradients  have  some  particularly  nice  properties,  as  we  will  see.  An 
important  example  is 

f  =/ _ zf _ ZZ _ zi _ \ 

\  (x2  +y2  +Z2)3/2  ’  (x2  +y2  +z2)3/2’  (jc2  +/  +  .2)3/2  /  ’ 


545 


546  Vector  Calculus 


which  points  from  the  point  (x,y,z)  toward  the  origin  and  has  length 


a/x2  +  y2  +  z2  _ 1 _ 

(x2  +y2  +z2)3/2  (V-^+y^T?)2’ 

which  is  the  reciprocal  of  the  square  of  the  distance  from  (x,y,z)  to  the  origin — in  other  words,  f  is  an 
“inverse  square  law”.  The  vector  f  is  a  gradient: 


f  = 


V 


1 

a/ x2  +y2  +  z2 


which  turns  out  to  be  extremely  useful. 


(16.1) 


Exercises  for  16.1 


Exercise  16.1.1 
Exercise  16.1.2 
Exercise  16.1.3 
Exercise  16.1.4 
Exercise  16.1.5 
Exercise  16.1.6 


Investigate  the  vector  field  (x,y) 
Investigate  the  vector  field  (—  x,  —  y) 
Investigate  the  vector  field  (x,  —y) 
Investigate  the  vector  field  (sinx,cosy) 
Investigate  the  vector  field  (x+  l,x  +  3) 
Verify  Equation  16.1. 


16.2  Divergence  and  Curl 


Divergence  and  curl  are  two  measurements  of  vector  fields  that  are  very  useful  in  a  variety  of  applications. 
Both  are  most  easily  understood  by  thinking  of  the  vector  field  as  representing  a  flow  of  a  liquid  or  gas;  that 
is,  each  vector  in  the  vector  field  should  be  interpreted  as  a  velocity  vector.  Roughly  speaking,  divergence 
measures  the  tendency  of  the  fluid  to  collect  or  disperse  at  a  point,  and  curl  measures  the  tendency  of  the 
fluid  to  swirl  around  the  point.  Divergence  is  a  scalar,  that  is,  a  single  number,  while  curl  is  itself  a  vector. 
The  magnitude  of  the  curl  measures  how  much  the  fluid  is  swirling,  the  direction  indicates  the  axis  around 
which  it  tends  to  swirl.  These  ideas  are  somewhat  subtle  in  practice,  and  are  beyond  the  scope  of  this 
course. 

Recall  that  if  /  is  a  function,  the  gradient  of  /  is  given  by 


V/ 


d£  df  df\ 
dx  ’  dy  ’  dz  / 


16.2.  Divergence  and  Curl  547 


A  useful  mnemonic  for  this  (and  for  the  divergence  and  curl,  as  it  turns  out)  is  to  let 


V 


d_  d_  d_ 

dx'  dy’  dz 


that  is,  we  pretend  that  V  is  a  vector  with  rather  odd  looking  entries.  Recalling  that  (u,  v,  w)a—  ( ua ,  va,  wa) , 
we  can  then  think  of  the  gradient  as 


V/ 


dxdy’dz  3 


dfdfdf\ 

dx’dy’dz/’ 


that  is,  we  simply  multiply  the  /  into  the  vector. 

The  divergence  and  curl  can  now  be  defined  in  terms  of  this  same  vector  V  by  using  the  cross  product 
and  dot  product. 


Here  are  two  simple  but  useful  facts  about  divergence  and  curl. 


In  words,  this  says  that  the  divergence  of  the  curl  is  zero. 


That  is,  the  curl  of  a  gradient  is  the  zero  vector.  Recalling  that  gradients  are  conservative  vector  fields, 
this  says  that  the  curl  of  a  conservative  vector  field  is  the  zero  vector.  Under  suitable  conditions,  it  is  also 
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true  that  if  the  curl  of  f  is  0  then  f  is  conservative.  (Note  that  this  is  exactly  the  same  test  that  we  discussed 
at  the  end  of  Section  16.3.3.) 


Example  16.3: 


Let  f  =  (2ex  —  y,  —x,  ez) .  Show  that  f  is  conservative  and  find  the  function  f  such  that  f  =  (fx,  fy,fz) . 


Solution.  If  f  =  (2eY  —  y,  —  x,ez)  then  V  x  f  =  (0,0,  (  —  1)  —  (  —  1))  =  0.  Thus,  f  is  conservative,  and  we  can 
exhibit  this  directly  by  finding  the  corresponding  /. 

Since  fx  =  2 ex  -y,  f  =  2 ex  -xy  +  g(y,z).  Since  fy  =  -x,  it  must  be  that  gy  =  0,  so  g(y,z)  =  C  +  h(z). 
Thus  f  —  2ex  —  xy  +  C  +  h(z )  and 

fz  =  h'(z)  = 

so  h(z)  —  ez.  This  leaves  /  =  2ex  —  xy  +  ez  +  C.  X 


Exercises  for  16.2 


Exercise  16.2.1  Let  f  —  (xy,  —  xy)  and  let  D  be  given  by  0  <  x  <  1,  0  <  y  <  1.  Compute  /  /■  dr  and 

JdD 


IdD 


f-Nds. 


Exercise  16.2.2  Let  f  —  (ax2, by1)  and  let  D  be  given  by  0  <  x  <  I,  0  <  y  <  1.  Compute  /  /•  dr  and 

JdD 


IdD 


f-Nds. 


Exercise  16.2.3  Let  f  —  (ay2,bx2)  and  let  D  be  given  by  0  <  x  <  1,  0  <  y  <  x.  Compute  /  /•  dr  and 

JdD 


IdD 


f-Nds. 


Exercise  16.2.4  Let  f  —  (sinxcosy,cosxsiny)  and  let  D  be  given  by  0  <  x  <  n /2,  0  <  y  <  x.  Compute 
/  f- dr  and  /  f-Nds. 

JdD  JdD 

Exercise  16.2.5  Let  f=  (y,—x)  and  let  D  be  given  by  x2 +y2  <  l.  Compute  /  f- dr  and  /  f-Nds. 

J  dD  J dD 

Exercise  16.2.6  Let  f  =  (x,y)  and  let  D  be  given  by  x2  +v2  <  1.  Compute  /  f- dr  and  /  f-Nds. 

JdD  JdD 


Exercise  16.2.7  Prove  Theorem  16.1. 

Exercise  16.2.8  Prove  Theorem  16.2. 

Exercise  16.2.9  If  V  ■/  =  0,  /  is  said  to  be  incompressible.  Show  that  any  vector  field  of  the  form 
f(x,y,z )  =  (f(y, z),g (x, z),h (x,  y ) )  is  incompressible.  Give  a  non-trivial  example. 
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16.3  Line  Integrals 


We  have  so  far  integrated  “over”  intervals,  areas,  and  volumes  with  single,  double,  and  triple  integrals.  We 
now  investigate  integration  over  or  “along”  a  curve — “line  integrals”  are  really  “curve  integrals”. 

16.3.1.  Line  Integrals  of  a  Function 


As  with  other  integrals,  a  geometric  example  may  be  easiest  to  understand.  Consider  the  function  /  =  x+y 
and  the  parabola  y  —  x2  in  the  x-y  plane,  for  0  <  x  <  2.  Imagine  that  we  extend  the  parabola  up  to  the 
surface  /,  to  form  a  curved  wall  or  curtain,  as  in  Figure  16.2.  What  is  the  area  of  the  surface  thus  formed? 
We  already  know  one  way  to  compute  surface  area,  but  here  we  take  a  different  approach  that  is  more 
useful  for  the  problems  to  come. 


Figure  16.2:  Approximating  the  area  under  a  curve. 

As  usual,  we  start  by  thinking  about  how  to  approximate  the  area.  We  pick  some  points  along  the  part 
of  the  parabola  we’re  interested  in,  and  connect  adjacent  points  by  straight  lines;  when  the  points  are  close 
together,  the  length  of  each  line  segment  will  be  close  to  the  length  along  the  parabola.  Using  each  line 
segment  as  the  base  of  a  rectangle,  we  choose  the  height  to  be  the  height  of  the  surface  /  above  the  line 
segment.  If  we  add  up  the  areas  of  these  rectangles,  we  get  an  approximation  to  the  desired  area,  and  in 
the  limit  this  sum  turns  into  an  integral. 

Typically  the  curve  is  in  vector  form,  or  can  easily  be  put  in  vector  form.  Then  as  we  have  seen  in 
Section  15.3,  the  length  of  one  of  the  straight  line  segments  in  the  approximation  is  close  to  ds  —  |v'|  dt. 
In  this  example  we  have  y(t)  —  (t,t2),  and  ds  —  |v'|  dt  —  \J 1  +  At2dt.  Therefore  the  integral  is 

[  f(t,t2)Vl+4t2dt  =  [  (t  +  t2)\/l  +4t2dt  —  -jx-Vll  —  -pr  —  ^-ln(4  +  \/l7). 

Jo  Jo  48  12  64 

This  integral  of  a  function  along  a  curve  C  is  often  written  in  abbreviated  form  as 
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Example  16.5:  Compute  Line  Segment 

Compute  j 

f  yex  ds  where  C  is  the  line  segment  from  (1,2)  to  (4,7). 
c 

Solution.  We  write  the  line  segment  as  a  vector  function:  v  =  (1,2)  +t{3,5),  0  <  t  <  1,  or  in  parametric 
form  x  —  1  +  3t,  y  =  2  +  5t.  Then 


(2  +  5t)e1+3V32  +  52J/  =  ^-V34e4-^V34e. 


X 


All  of  these  ideas  extend  to  three  dimensions  in  the  obvious  way. 


Example  16.6:  Compute  Line  Segment 

Compute  j 

f  x2zds  where  C  is  the  line  segment  from  (0,6,  —  1)  to  (4, 1,5). 
c 

Solution.  We  write  the  line  segment  as  a  vector  function:  v  =  (0,6, —  1)  +  t(4,  —  5,6),  0  <  t  <  1,  or  in 
parametric  form  x  —  4t,  y  =  6  —  5t,  z  —  —1  +  6 1.  Then 


[  x2zds  —  /1(4t)2(— 1  +  6t)\/16  +  25  +  36Jt  =  16\/77  ['  -t2  +  6t3 dt  - 
Jc  Jo  Jo  3 


X 


16.3.2.  Line  Integrals  of  a  Vector  Field 


Now  we  turn  to  a  perhaps  more  interesting  example.  Recall  that  in  the  simplest  case,  the  work  done  by  a 
force  on  an  object  is  equal  to  the  magnitude  of  the  force  times  the  distance  the  object  moves;  this  assumes 
that  the  force  is  constant  and  in  the  direction  of  motion.  We  have  already  dealt  with  examples  in  which  the 
force  is  not  constant;  now  we  are  prepared  to  examine  what  happens  when  the  force  is  not  parallel  to  the 
direction  of  motion. 


We  have  already  examined  the  idea  of  components  of  force,  in  Example  12.5:  the  component  of  a 
force  F  in  the  direction  of  a  vector  v  is 


F  v 


v, 
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the  projection  of  F  onto  v.  The  length  of  this  vector,  that  is,  the  magnitude  of  the  force  in  the  direction  of 
v,  is 

F  v 

I  I  ’ 

|v| 

the  scalar  projection  of  F  onto  v.  If  an  object  moves  subject  to  this  (constant)  force,  in  the  direction  of  v, 
over  a  distance  equal  to  the  length  of  v,  the  work  done  is 


F  v 

Ivl 


V 


F  y. 


Thus,  work  in  the  vector  setting  is  still  “force  times  distance”,  except  that  “times”  means  “dot  product”. 

If  the  force  varies  from  point  to  point,  it  is  represented  by  a  vector  field  F  (note  we  use  a  capital  F  here 
to  denote  force),  the  displacement  vector  v  may  also  change,  as  an  object  may  follow  a  curving  path  in 
two  or  three  dimensions.  Suppose  that  the  path  of  an  object  is  given  by  a  vector  function  r(t);  at  any  point 
along  the  path,  the  (small)  tangent  vector  r'  At  gives  an  approximation  to  its  motion  over  a  short  time  At, 
so  the  work  done  during  that  time  is  approximately  F  •  r'  At;  the  total  work  over  some  time  period  is  then 


F  r  'dt. 


It  is  useful  to  rewrite  this  in  various  ways  at  different  times.  We  start  with 


f1  F-r' dt  =  [  F  dr, 

Jt0  JC 

abbreviating  r'  dt  by  Jr.  Or  we  can  write 

/%-r 'dt=  riF~|r'| dt  =  F •  T |r'| dt  =  f  FT ds, 

JtQ  JtQ  |r|  Jt0  Jc 

using  the  unit  tangent  vector  T,  abbreviating  |r/|  Jt  as  ds,  and  indicating  the  path  of  the  object  by  C.  In 
other  words,  work  is  computed  using  a  particular  line  integral  of  the  form  we  have  considered.  Alternately, 
we  sometimes  write 

IcF  r'd‘  =  Sc{h’h’h)  •  W’y'-Z')d'  =  Jc  {f'Tt+flJ,  +/3§)  * 

=  /  /i dx  +  f2dy  +  fs dz  —  /  fidx+  /  fidy+  /  fodz, 

Jc  Jc  Jc  Jc 

and  similarly  for  two  dimensions,  leaving  out  references  to  z. 

We  define  this  formally. 


— 

Definition  16.7:  Line  Integral  of  a  Vector  Field 

Let  F  be  a  vector  field  and  C  a  <. 
given  by 

J 

:urve  with  parametrization  r(t).  Then  the  integral  of  F  over  C  is 

(  F  ■  dr  =  (  F-  Tds  —  pF-i'dt 

C  Jc  JtQ 
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Solution.  We  can  write  the  force  in  terms  of  t  as  (tsin(t2),t2),  and  compute  r '(t)  —  (1,2 1),  and  then  the 
work  is 


(tsin(r),r)  •  (1,2 t)dt 


tsin(r)  +  2 t3  dt 


cos(l)  —  cos(4) 
2 


+ 


15 

T ' 


Alternately,  we  might  write 


J  xsin ydx+  J  ydy 


rl  .  .  2\  ,  f4  .  cos(4)  cos(l)  16  1 

xsm (ir)dx+  J  ydy  = - - 2+Y~2 


getting  the  same  answer. 


X 


Exercises  for  16.3.2 


Exercise  16.3.1  Compute  /  xy2  ds  along  the  line  segment  from  (1,2,0)  to  (2, 1,3). 

Jc 

Exercise  16.3.2  Compute  /  sinxr/5  along  the  line  segment  from  (  —  1,2, 1)  to  (1,2,5). 

Jc 

Exercise  16.3.3  Compute  j  zcos(xy)  ds  along  the  line  segment  from  (1,0, 1)  to  (2,2,3). 

Exercise  16.3.4  Compute  /  xey  dx-\-  x2y dy  along  the  line  segment  y  —  3,  0  <  x  <  2. 

Jc 

Exercise  16.3.5  Compute  /  xey  dx  +  x2y dy  along  the  line  segment  x  —  4,  0  <  y  <  4. 

Jc 

Exercise  16.3.6  Compute  j  xey  dx  +  x2y  dy  along  the  curx’e  x  —  3t,  y  —  t2,  0  <  t  <  1. 

Exercise  16.3.7  Compute  /  xey  dx  +  x2y dy  along  the  cur\>e  (e1  ,e‘),  —  1  <  t  <  1. 

Jc 

Exercise  16.3.8  Compute  j  (cosx,siny)  •  dr  along  the  cur\’e  (t,t),  0  <  t  <  1. 

Exercise  16.3.9  Compute  /  (1/xy,  1  /  (x+y))  ■  dr  along  the  path  from  ( 1, 1)  to  (3, 1)  to  (3, 6)  using  straight 

Jc 

line  segments. 
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Exercise  16.3.10  Compute  j  (1  /xy,  l/(x  +  y))  ■  dr  along  the  curve  (2t,5t),  1  <  t  <  4. 

Exercise  16.3.11  Compute  j  (1  /xy,  l/(x  +  y))  ■  dr  along  the  curve  (t,t2),  1  <  t  <  4. 

Exercise  16.3.12  Compute  j  yzdx  +  xzdy  +  xy dz  along  the  curve  (t,t2,t3),  0  <  t  <  1. 

Exercise  16.3.13  Compute  /  yzdx  +  xzdy +  xydz  along  the  cun’e  (cost,  sin/,  tant),  0  <  t  <  n. 

Jc 

Exercise  16.3.14  An  object  moves  from  (1, 1)  to  (4,8)  along  the  path  r(t )  =  (/2,/3),  subject  to  the  force 
F  —  (x2,  sin};).  Find  the  work  done. 


Exercise  16.3.15  An  object  moves  along  the  line  segment  from  (1, 1)  to  (2,5),  subject  to  the  force  F  = 
(x/ ( x 2  +y2),y/ {x2  +  y2)).  Find  the  work  done. 

Exercise  16.3.16  An  object  moves  along  the  parabola  /*(/)  =  (t,t2),  0  <  t  <  1,  subject  to  the  force  F  — 
(l/(y  +  1),  — l/(x+  1)).  Find  the  work  done. 


Exercise  16.3.17  An  object  moves  along  the  line  segment  from  (0,0,0)  to  (3,6,10),  subject  to  the  force 
F  —  (x2,y2,z2).  Find  the  work  done. 

Exercise  16.3.18  An  object  moves  along  the  cur\’e  r(t)  —  (\/t, \/\/t,t)  1  <  t  <  4,  subject  to  the  force 
F  —  ( y,z,x ).  Find  the  work  done. 

Exercise  16.3.19  An  object  moves  from  (1, 1, 1)  to  (2,4,8)  along  the  path  r(t)  —  (. t,t2,t 3),  subject  to  the 
force  F  —  (sin.r,  sin  v,  sinz).  Find  the  work  done. 


Exercise  16.3.20  An  object  moves  from  (1,0,0)  to  ( —  1 , 0, 7T)  along  the  path  r[t )  =  (cos/,  sin?,/),  subject 
to  the  force  F  —  (y2,y2,xz).  Find  the  work  done. 


Exercise  16.3.21  Give  an  example  of  a  non-trivial  force  field  F  and  non-trivial  path  r{t)  for  which  the 
total  work  done  moving  along  the  path  is  zero. 


16.3.3.  The  Fundamental  Theorem  of  Line  Integrals 


One  way  to  write  the  Fundamental  Theorem  of  Calculus  (Theorem  6.13)  is: 

/  f{x)dx  =  f(b)-f(a). 

J  a 

That  is,  to  compute  the  integral  of  a  derivative  f  we  need  only  compute  the  values  of  /  at  the  endpoints. 
Something  similar  is  true  for  line  integrals  of  a  certain  form. 
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Proof.  We  write  r  =  (x(t),y(t),z(t)),  so  that  r'  =  (x '(t),y/{t),z!(t)).  Also,  we  know  that  V/  =  (fx,fy,fz). 

Then 

fcyf'dr  =  J  (fxJyJz)  ■  (d(t),y'(t),z!{t))dt  =  J  fxx'  +  fyy'  +  fzzdt. 

By  the  chain  rule  (see  Section  13.4)  f^+fyy'+fvz'  =  df  /dt,  where  /  in  this  context  means  f(x(t),y(t),z(t)), 
a  function  of  t.  In  other  words,  all  we  have  is 

[  f'{t)dt  = 

J  a 

In  this  context,  f(a)  =  f(x(a),y(a),z(a)).  Since  a  =  r(a)  =  ( x(a),y(a),z(a )),  we  can  write  f(a)  — 
/(a) — this  is  a  bit  of  a  cheat,  since  we  are  simultaneously  using  /  to  mean  f(t )  and  f(x,y,z),  and  since 
f(x(a),y(a),z(a))  is  not  technically  the  same  as  f((x(a),y(a),z(a))),  but  the  concepts  are  clear  and  the 
different  uses  are  compatible.  Doing  the  same  for  b,  we  get 

JcVf-dr  =  faf'{t)dt  =  f(b)  —  f(a)  =  /(b)  -  /(a). 


X 

This  theorem,  like  the  Fundamental  Theorem  of  Calculus,  says  roughly  that  if  we  integrate  a  “derivative¬ 
like  function”  (/'  or  V/)  the  result  depends  only  on  the  values  of  the  original  function  (/)  at  the  endpoints. 

If  a  vector  field  f  is  the  gradient  of  a  function,  f  =  V/,  we  say  that  f  is  a  conservative  vector  field.  If  F 
is  a  conservative  force  field,  then  the  integral  for  work,  fc  F  •  Jr,  is  in  the  form  required  by  the  Fundamental 
Theorem  of  Line  Integrals.  This  means  that  in  a  conservative  force  field,  the  amount  of  work  required  to 
move  an  object  from  point  a  to  point  b  depends  only  on  those  points,  not  on  the  path  taken  between  them. 


Solution.  The  straightforward  way  to  do  this  involves  substituting  the  components  of  r  into  F,  forming 
the  dot  product  F  •  r',  and  then  trying  to  compute  the  integral,  but  this  integral  is  extraordinarily  messy, 
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perhaps  impossible  to  compute.  From  Equation  16.1  we  know  that  F 
only  substitute: 


[  F  •  dr  =  1 

'C  \Jx2  +y2  +  z2 


(2, b-l) 


(1,0,0) 


1 

y/6 


V ( 1  / \Jx2  +y2  +  z2)  so  we  need 


-  1. 


* 


Another  immediate  consequence  of  the  Fundamental  Theorem  involves  closed  paths.  A  path  C  is 
closed  if  it  forms  a  loop,  so  that  travelling  over  the  C  curve  brings  you  back  to  the  starting  point.  If  C  is  a 
closed  path,  we  can  integrate  around  it  starting  at  any  point  a;  since  the  starting  and  ending  points  are  the 
same, 

jfv/-dr  =  /(a)~/(a)  =  0. 

For  example,  in  a  gravitational  field  (an  inverse  square  law  field)  the  amount  of  work  required  to  move  an 
object  around  a  closed  path  is  zero.  Of  course,  it’s  only  the  net  amount  of  work  that  is  zero.  It  may  well 
take  a  great  deal  of  work  to  get  from  point  a  to  point  b,  but  then  the  return  trip  will  “produce”  work.  For 
example,  it  takes  work  to  pump  water  from  a  lower  to  a  higher  elevation,  but  if  you  then  let  gravity  pull 
the  water  back  down,  you  can  recover  work  by  running  a  water  wheel  or  generator.  (In  the  real  world  you 
won’t  recover  all  the  work  because  of  various  losses  along  the  way.) 

To  make  use  of  the  Fundamental  Theorem  of  Line  Integrals,  we  need  to  be  able  to  spot  conservative 
vector  fields  f  and  to  compute  /  so  that  f  =  V/.  Suppose  that  f  =  (/i,/2)  =  V/.  Then  f\  —  fx  and 
/2  =  fy,  and  provided  that  /  is  sufficiently  nice,  we  know  from  Clairaut’s  Theorem  (13.23)  that  j^/i  = 

fxy  =  fy x  —  Jy./A  If  we  compute  ^-  f\  and  Jf/2  and  find  that  they  are  not  equal,  then  f  is  not  conservative. 

If  Jy/j  =  Jy/2,  then,  again  provided  that  f  is  sufficiently  nice,  we  can  be  assured  that  f  is  conservative. 
Ultimately,  what’s  important  is  that  we  be  able  to  find  /;  as  this  amounts  to  finding  anti-derivatives,  we 
may  not  always  succeed. 


Solution.  First,  note  that 

4-  (4x3  -  v2)  =  -2 y  and  -4(4-  2xy)  =  -2 y, 

ay  dx 

so  the  desired  /  does  exist.  This  means  that  fx  —  4x3  —  y2,  so  that  f  —  xA  —xy1  +g(y);  the  first  two  terms 
are  needed  to  get  4x3  —  y2,  and  the  g(y)  could  be  any  function  of  y,  as  it  would  disappear  upon  taking  a 
derivative  with  respect  to  x.  Likewise,  since  fy  —  4  —  2xy,  /  =  4y  —  xy2  +  h  (x) .  The  question  now  becomes, 
is  it  possible  to  find  g(y)  and  h  (x)  so  that 

x4  -  .xy2  +  g(y)  =4 y-xv2  +  h(x) , 

and  of  course  the  answer  is  yes:  g(y)  =  4 y,  h(x)  =  x4.  Thus,  /  =  4y  +  x4  —  xy2.  X 

We  can  test  a  vector  field  f  =  (f\,f2,h)  in  a  similar  way.  Suppose  that  =  {. fx,fy,fz )■  If 

we  temporarily  hold  z  constant,  then  f(x,y,z )  is  a  function  of  x  and  y,  and  by  Clairaut’s  Theorem  ^-  f  \  — 
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fxy  =  fyx  =  jfx-fi-  Likewise,  holding  y  constant  implies  jj-J\  =  fxz  =  fvc=  j~xh,  and  with  x  constant  we 
get  =  fyz  =fzy  =  Conversely,  if  we  find  that  j-yf\  =  j-J2,  j-J]  =  j-Ji,  and  j-J2  =  ^/3  then 


dz^  ^  Jzy  <9  y 

f  is  conservative. 


Exercises  for  16.3.3 


Exercise  16.3.22  Find  an  f  so  that  V/  =  (2x  +  y2,2y  +  x2),  or  explain  why  there  is  no  such  f. 
Exercise  16.3.23  Find  an  f  so  that  V/  =  (x3,  —  y4),  or  explain  why  there  is  no  such  f. 
Exercise  16.3.24  Find  an  f  so  that  V/  =  (xey,yex),  or  explain  why  there  is  no  such  f. 
Exercise  16.3.25  Find  an  f  so  that  V/  =  (ycosx,y  sinx),  or  explain  why  there  is  no  such  f. 
Exercise  16.3.26  Find  an  f  so  that  V/  =  (ycosx,  sinx),  or  explain  why  there  is  no  such  f. 
Exercise  16.3.27  Find  an  f  so  that  Vf  =  (x2y3,xy4),  or  explain  why  there  is  no  such  f. 
Exercise  16.3.28  Find  an  f  so  that  V/  =  (yz,xz,xy),  or  explain  why  there  is  no  such  f. 


Exercise  16.3.29  Evaluate  /  ( 1  Ox4  —  2xy3 )  dx  —  3 x2y2  dv  where  C  is  the  part  of  the  curve  x5  —  5x2y2 

Jc 

lx 2  =  0  from  (0,0)  to  (3,2). 

Exercise  16.3.30  Let  F  =  (yz„  xz„  xy) .  Find  the  work  done  by  this  force  field  on  an  object  that  moves  from 
(1,0,2)  to  (1,2,3). 

Exercise  16.3.31  Let  F  —  (ey,xey  +  sinz,ycosz).  Find  the  work  done  by  this  force  field  on  an  object  that 
moves  from  (0,0,0)  to  (1,  — 1,3). 


Exercise  16.3.32  Let 


—x 


-y 


— z 


(x2  +y2  +z2)3/2  ’  (x2  +y2  +z2)3/2  ’  (x2  +y2  +  z2)3/2  / 
Find  the  work  done  by  this  force  field  on  an  object  that  moves  from  (1, 1, 1)  to  (4,5,6). 


16.4  Green’s  Theorem 


We  now  come  to  the  first  of  three  important  theorems  that  extend  the  Fundamental  Theorem  of  Calculus 
to  higher  dimensions.  (The  Fundamental  Theorem  of  Line  Integrals  has  already  done  this  in  one  way,  but 
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in  that  case  we  were  still  dealing  with  an  essentially  one-dimensional  integral.)  They  all  share  with  the 
Fundamental  Theorem  the  following  rather  vague  description:  To  compute  a  certain  sort  of  integral  over 
a  region,  we  may  do  a  computation  on  the  boundary  of  the  region  that  involves  one  fewer  integrations. 

Note  that  this  does  indeed  describe  the  Fundamental  Theorem  of  Calculus  and  the  Fundamental  Theo¬ 
rem  of  Line  Integrals:  to  compute  a  single  integral  over  an  interval,  we  do  a  computation  on  the  boundary 
(the  endpoints)  that  involves  one  fewer  integrations,  namely,  no  integrations  at  all. 


Theorem  16.12:  Green’s  Theorem 


If  the  vector  held  f=  {fi,fi)  and  the  region  D  are  sufficiently  nice,  and  if  C  is  the  boundary  of  D 
(C  is  a  closed  curve),  then 

ff  -pL-~P~dA^  [  f\dx  +  f@dy, 

JJd  ox  dy  Jc 

provided  the  integration  on  the  right  is  done  counter-clockwise  around  C. 


The  proof  of  Green’s  Theorem  will  follow  a  discussion  and  several  examples. 

To  indicate  that  an  integral  /  is  being  done  over  a  closed  curve  in  the  counter-clockwise  direction,  we 

Jc 

usually  write  j)  .  We  also  use  the  notation  dD  to  mean  the  boundary  of  D  oriented  in  the  counterclockwise 

direction.  With  this  notation,  f  —  . 

Jc  J  dD 

We  already  know  one  case,  not  particularly  interesting,  in  which  this  theorem  is  true:  If  f  is  conser¬ 


vative,  we  know  that  the  integral  j)  f  •  dr  —  0,  because  any  integral  of  a  conservative  vector  field  around 

a  closed  curve  is  zero.  We  also  know  in  this  case  that  dfi/dy  —  d/2/dv,  so  the  double  integral  in  the 
theorem  is  simply  the  integral  of  the  zero  function,  namely,  0.  So  in  the  case  that  f  is  conservative,  the 
theorem  says  simply  that  0  =  0. 


Example  16.13: 


We  illustrate  the  theorem  by  computing  both  sides  of 

/  x3 dx  +  xy2 dy  —  //  y  —  0dA, 
J  9d  "  JJd 

where  D  is  the  triangular  region  with  corners  (0,0),  (1,0),  (0,-1). 


Solution.  Starting  with  the  double  integral: 


(-1+*)3  ,  _  (-1+*)4 
Jo  3  12 


1 

12' 


There  is  no  single  formula  to  describe  the  boundary  of  D,  so  to  compute  the  left  side  directly  we 
need  to  compute  three  separate  integrals  corresponding  to  the  three  sides  of  the  triangle,  and  each  of  these 
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integrals  we  break  into  two  integrals,  the  “r/x”  part  and  the  “'dy”  part.  The  three  sides  are  described  by 
y  =  0,  y  —  —  1+ jc,  and  x  —  0.  The  integrals  are  then 


xidx+xydy  =  /  x3dx+  /  0 dy+  /  xidx  + 


IdD 


10 


to 


'1 


/  0 


1  n  1  l  n  n  1 

=  7  +  °-7  +  -7^+0  +  0=— — . 

4  4  12  12 


rO  r  —  \ 

(y  +  1  )y2 dy  +  /  0 dx+  /  0 dy 

Jo  Jo 


Alternately,  we  could  describe  the  three  sides  in  vector  form  as  (t, 0)  for  t  from  0  to  1,  (t  +  l, t)  for  t 
from  0  to  —1,  and  (0,  —  1  4- 1)  for  t  from  0  to  1.  Note  that  in  each  case,  as  t  ranges  from  lower  to  upper 
bound,  we  follow  the  corresponding  side  in  the  correct  direction.  Now 


x  dx  +  xy  dy 


t2  T  t  •  0  dt  4- 


r— t 


(t  -|-  l)^  -|-  (t  -|-  l)t2r/f  T-  /  0t0( — 14 -t^dt 


-l 


f- 1 


1 


t  dt  {t  -\- 1)'  4~  (t  T  1  — - . 

Jo  12 


X 


In  this  case,  none  of  the  integrations  are  difficult,  but  the  second  approach  is  somewhat  tedious  because 
of  the  necessity  to  set  up  three  different  integrals.  In  different  circumstances,  either  of  the  integrals,  the 
single  or  the  double,  might  be  easier  to  compute.  Sometimes  it  is  worthwhile  to  turn  a  single  integral  into 
the  corresponding  double  integral,  sometimes  exactly  the  opposite  approach  is  best. 


Here  is  a  clever  use  of  Green’s  Theorem:  We  know  that  areas  can  be  computed  using  double  integrals, 
namely, 


computes  the  area  of  region  D.  If  we  can  find  /i  and  fj  so  that  r)  f2 / dx  —  r)  j\  /dy  —  1,  then  the  area  is  also 


IdD 


f\  dx  +  f2  dy. 


It  is  quite  easy  to  do  this:  fi  —  0,f2  —  x  works,  as  do  f\  —  —y,f2  —  0  and  f\  —  —  y/2,f2  —  x/2. 


Solution.  We  find  the  area  of  the  interior  of  the  ellipse  via  Green’s  theorem.  To  do  this  we  need  a  vector 
equation  for  the  boundary;  one  such  equation  is  (acos/,fisint),  as  t  ranges  from  0  to  2k.  We  can  easily 
verify  this  by  substitution: 


x2  y2 

2  "TV 
u 


b2 


a 2  cos2 1  b2  sin2 1 

- ^ - b 


b2 


—  cos- 1 4~  sin- 1  =  1. 
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Let’s  consider  the  three  possibilities  for  f\  and  f2  above:  Using  0  and  x  gives 


/  f  i  7 

fOdx  +  xdy  —  /  acos(t)bcos(t)dt  —  /  abcos  (t)dt. 

Jc  '  Jo  Jo 

Using  —  y  and  0  gives 

f  flK  [2K  2 

f—ydx  +  Ody  —  /  —  frsin(t)(  —  asin(f))  dt  —  /  absm~(t)dt. 

Jc  '  Jo  Jo 


Finally,  using  —  y/2  andx/2  gives 


r  -  dx+  -  dy  — 
c  2  2 


'o 


rl7t  bsin(t)  .  .  .  ..  ,  acos(t)  .  ...  , 

— - — (-asm(t))dt-\ - — — ~(bcos(t))dt 

[2n  ab  sin2 1  ab  cos2 1  ,  f2n  ab 

/  — ^ - 1 - x - dt  —  /  —dt  —  TZab. 

I  o  2  2  Jo  2 


The  first  two  integrals  are  not  particularly  difficult,  but  the  third  is  very  easy,  though  the  choice  of  f\  and 
f2  seems  more  complicated.  X 


Now  we  look  at  the  proof  of  Green’s  Theorem. 

Proof.  We  cannot  here  prove  Green’s  Theorem  in  general,  but  we  can  do  a  special  case.  We  seek  to  prove 
that  for  a  vector  field  f  = 

<f  f\dx  +  f2dy=  I j  ^-^y-dA. 

Jc  JJd  ox  dy 

It  is  sufficient  to  show  that 

/  fi  dx  —  I f  — dA  and  /  fody  —  I [  dA, 

Jc  JJd  dy  Jc  •  JJD  dx 

which  we  can  do  if  we  can  compute  the  double  integral  in  both  possible  ways,  that  is,  using  dA  —  dydx 
and  dA  —  dxdy. 

For  the  first  equation,  we  start  with 

//  d-J-dA=  f  riX)3-Adydx=  f '’flO,g2(x))-Mx,gl(x))dx. 

JJd  dy  Ja  Jgi(x)  dy  Ja 
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Here  we  have  simply  used  the  ordinary  Fundamental  Theorem  of  Calculus,  since  for  the  inner  integral 
we  are  integrating  a  derivative  with  respect  to  y:  an  antiderivative  of  dj\  / dy  with  respect  to  y  is  simply 
/i  (x,y),  and  then  we  substitute  g\  and  go  for  y  and  subtract. 

Now  we  need  to  manipulate  §cf\dx.  The  boundary  of  region  D  consists  of  4  parts,  given  by  the 
equations  y  =  gi(jt),  x  —  b,  y  —  g2(x),  and  x  —  a.  On  the  portions  x—b  and  x  —  a,  dx  =  0 dt,  so  the 
corresponding  integrals  are  zero.  For  the  other  two  portions,  we  use  the  parametric  forms  x  —  t,  y  —  g\ (t), 
a  <t  <b,  and  x  =  t,  y  =  g2{t),  letting  t  range  from  b  to  a,  since  we  are  integrating  counter-clockwise 
around  the  boundary.  The  resulting  integrals  give  us 

r  rb  ra  rb  rb 

ffidx=  fi(t,gi(t))dt+  fi(t,gi(t))dt  —  /  fi{t,gi(t))dt-  fi(t,g2(t))dt 

JC  Ja  Jb  Ja  Ja 

=  [  fi(t,gi(t))-fi(t,g2(t))dt 


which  is  the  result  of  the  double  integral  times  —1,  as  desired. 

The  equation  involving  f2  is  essentially  the  same,  and  left  as  an  exercise.  4k 

We  can  now  rewrite  Green’s  Theorem  using  the  concepts  of  divergence  and  curl;  these  rewritten  ver¬ 
sions  in  turn  are  closer  to  some  later  theorems  we  will  see. 

Suppose  we  write  a  two  dimensional  vector  held  in  the  form  f  =  (/i,/2,0),  where  f\  and  f2  are  func¬ 
tions  of  x  and  y.  Then 

i  j  k 


Vxf  = 


d_  d_ 
dx  dy 

h  h 


d_ 

dz 

o 


=  (0,0. 


df2  dfi 
dx  dy 


). 


and  so  (V  x  f)  •  k  =  (0,0,  ■  (0,0, 1)  =  So  Green’s  Theorem  says 


IdD 


f- dr —  /  f\dx  +  f2dy  — 


IdD 


dy 

df2  df\ 

D  dx  dy 


dA—  //  (V  x  f)  •  kdA. 


D 


(16.2) 


Roughly  speaking,  the  right-most  integral  adds  up  the  curl  (tendency  to  swirl)  at  each  point  in  the  region; 
the  left-most  integral  adds  up  the  tangential  components  of  the  vector  held  around  the  entire  boundary. 
Green’s  Theorem  says  these  are  equal,  or  roughly,  that  the  sum  of  the  “microscopic”  swirls  over  the  region 
is  the  same  as  the  “macroscopic”  swirl  around  the  boundary. 

Next,  suppose  that  the  boundary  dD  has  a  vector  form  r(/),  so  that  r'(/)  is  tangent  to  the  boundary, 
and  T  =  r/(t)/|r/(t)|  is  the  usual  unit  tangent  vector.  Writing  r  =  (x(t),y(t))  we  get 

rp  (^/> 


and  then 


N 


|r'(0l 

M 

|r'(0l 


is  a  unit  vector  perpendicular  to  T,  that  is,  a  unit  normal  to  the  boundary.  Now 

(/,-A 


IdD 


f-N  ds=  (fuf2) 


dD 


|r'(0l 


r'(t)\dt—  /  f\y'dt  —  f2x'dt 


IdD 
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=  /  fi  dy  f2dx 
JdD 


-f2dx  +  f\  dy. 


So  far,  we’ve  just  rewritten  the  original  integral  using  alternate  notation.  The  last  integral  looks  just  like 
the  left  side  of  Green’s  Theorem  (16.12)  except  that  f\  and  f2  have  traded  places  and  f2  has  acquired  a 
negative  sign.  Then  applying  Green’s  Theorem  we  get 

f  -f2dx  +  f\dy=  [[  ^p-  +  ?fidA=  [[  V-f dA. 

JdD  JJd  dx  dy  JJD 

Summarizing  the  long  string  of  equalities, 

[  f-N  ds=  [[  V-f  dA.  (16.3) 

JdD  JJd 


Roughly  speaking,  the  hrst  integral  adds  up  the  flow  across  the  boundary  of  the  region,  from  inside  to  out, 
and  the  second  sums  the  divergence  (tendency  to  spread)  at  each  point  in  the  interior.  The  theorem  roughly 
says  that  the  sum  of  the  “microscopic”  spreads  is  the  same  as  the  total  spread  across  the  boundary  and  out 
of  the  region. 


Exercises  for  16.4 


Exercise  16.4.1 

Exercise  16.4.2 

Exercise  16.4.3 

Exercise  16.4.4 

Exercise  16.4.5 

Exercise  16.4.6 

Exercise  16.4.7 

Exercise  16.4.8 

Exercise  16.4.9 


Compute 


Compute 


Compute 


Compute 


Compute 


Compute 


Compute 


Compute 


Compute 


2ydx+3xdy,  where  D  is  described  by  0  <  x  <  1,  0  <  y  <  1. 
xydx  +  xydy,  where  D  is  described  by  0  <x  <  1,  0  <  y  <  1. 
e2x  1 3-v  dx  +  exy  dy,  where  D  is  described  by  —2  <x  <2,  — 1  <  y  <  1. 
ycosxdx  +  ysinxdy,  where  D  is  described  by  0  <x<tz/2,  1  <  y  <  2. 

9  9 

x  y  dx  +  vy"  dy,  where  D  is  described  by  0  <x  <  1,  0  <  y  <  x. 
x^/ydx-h  \Jx  +  y  dy,  where  D  is  described  by  1  <  x  <  2,  2x  <  y  <  4. 
(x/y)  dx  +  (2  +  3x)  dy,  where  D  is  described  by  1  <x  <2,  1  <  y  <  x2. 

siny dx+  sin.r dy,  where  D  is  described  by  0  <  x  <  n/2,  x  <  y  <  k/2. 

2 

xln  ydx,  where  D  is  described  by  1  <  x  <  2,  ex  <  y  <  ex  . 
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Exercise  16.4.10  Compute 


Vi  +  x2dy,  where  D  is  described  by  —  1  <  x  <  1,  x2  <  y  <  1. 


Exercise  16.4.11  Compute  /  x2ydx  —  xy1  dy,  where  D  is  described  by  x2  +  v2  <  1. 

JdD 

Exercise  16.4.12  Compute  /  v3  dx  +  2x3  dy,  where  D  is  described  by  x2  +y2  <  4. 

JdD' 


Exercise  16.4.13  Evaluate  j)  (y  —  sin(.r))  dx  +  cos(.r)  dy,  where  C  is  the  boundary  of  the  triangle  with 
vertices  (0,0),  (1,0),  and  (1,2)  oriented  counter-clockwise. 

Exercise  16.4.14  Finish  our  proof  of  Green’ s  Theorem  by  showing  that  <fi  fi_  d  v  =  ( f  -V  dA . 

Jc  '  JJd  ox 


16.5  The  Divergence  Theorem 


The  third  version  of  Green’s  Theorem  (Equation  16.3)  we  saw  was: 


/  f-N  ds=  V-f  dA. 

IdD  JJd 


With  minor  changes  this  turns  into  another  equation,  the  Divergence  Theorem: 


Theorem  16.15:  Divergence  Theorem 


Under  suitable  conditions,  if  E  is  a  region  of  three  dimensional  space  and  D  is  its  boundary  surface, 
oriented  outward,  then 


f  NdS  = 


D 


V-fdV. 


Proof.  Again  this  theorem  is  too  difficult  to  prove  here,  but  a  special  case  is  easier.  In  the  proof  of  a 
special  case  of  Green’s  Theorem,  we  needed  to  know  that  we  could  describe  the  region  of  integration  in 
both  possible  orders,  so  that  we  could  set  up  one  double  integral  using  dxdy  and  another  using  dydx. 
Similarly  here,  we  need  to  be  able  to  describe  the  three-dimensional  region  E  in  different  ways. 

We  start  by  rewriting  the  triple  integral: 


V-f  dV 


dfi  df2  df3 
dx  dy  dz 


)dV 


dV. 


The  double  integral  may  be  rewritten: 


ff  f-N dS=  [[  (/ii  +  /2j  +  /3k)-NJS=  [[  fii-NdS+  [[  /2j-NJS+  [[  f3k-NdS. 
JJd  JJd  JJd  JJd  JJd 
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To  prove  that  these  give  the  same  value  it  is  sufficient  to  prove  that 


IIL7Jv’ 

(16.4) 

jjDh\  •  N dS  =  1/1^  dV,  and 

(16.5) 

iidv ■ 

(16.6) 

Not  surprisingly,  these  are  all  pretty  much  the  same;  we’ll  do  the  first  one. 

We  set  the  triple  integral  up  with  dx  innermost: 

Jill 7dv  =  ILCm  3~£dxdA  =  JLMgliy-M -Msdy,z),y,z)dA, 

where  B  is  the  region  in  the  y-z  plane  over  which  we  integrate.  The  boundary  surface  of  E  consists  of  a 
“top”  x  —  g2{y,z),  a  “bottom”  x  —  gi(y,z),  and  a  “wrap-around  side”  that  is  vertical  to  the  y-z  plane.  To 
integrate  over  the  entire  boundary  surface,  we  can  integrate  over  each  of  these  (top,  bottom,  side)  and  add 
the  results.  Over  the  side  surface,  the  vector  N  is  perpendicular  to  the  vector  i,  so 


ff  /ii  ■  Nr/5 

J  J  side 


o. 


Thus,  we  are  left  with  just  the  surface  integral  over  the  top  plus  the  surface  integral  over  the  bottom.  For 
the  top,  we  use  the  vector  function  r  =  (g2(y,z),y,z)  which  gives  rv  xr,=  ( 1 ,  —giy,  —giz)\  the  dot  product 
of  this  withi  =  (1,0,0)  is  1.  Then 


ff  fii  -NdS  —  ff  f\{g2{y,z),y,z)dA. 

JJ  top  JJB 


In  almost  identical  fashion  we  get 


ff  /ii-Nr/S=- ff  fi(gi(y,z),y,z)dA , 

J  J  bottom  JJB 

where  the  negative  sign  is  needed  to  make  N  point  in  the  negative  x  direction.  Now 
ff  fii-NdS=  ff  fi(g2(y,z),y,z)dA-  ff  fi(gi(y,z),y,z)dA, 

JJD  JJB  JJB 

which  is  the  same  as  the  value  of  the  triple  integral  above.  4k 

It  is  worth  noting  that  this  theorem  is  also  referred  to  as  Gauss  ’  Theorem.  We  now  compute  an  example. 


Example  16.16: 


Let  f=  (2x,3 y,z2),  and  consider  the  three-dimensional  volume  inside  the  cube  with  faces  parallel 
to  the  principal  planes  and  opposite  corners  at  (0,0,0)  and  (1,1,1).  Compute  the  two  integrals  of 
the  divergence  theorem. 
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Solution.  The  triple  integral  is  the  easier  of  the  two: 


"1  rl 


0  JO  JO 


2  +  3  +  2  zdxdydz 


6. 


The  surface  integral  must  be  separated  into  six  parts,  one  for  each  face  of  the  cube.  One  face  is  z  —  0  or 
r  =  (u,v, 0),  0  <  u,v  <  1.  Then  ru  —  (1,0,0),  rv  =  (0, 1,0),  and  ru  x  rv  =  (0,0, 1).  We  need  this  to  be 
oriented  downward  (out  of  the  cube),  so  we  use  (0,0,  —  1)  and  the  corresponding  integral  is 


1  rl 


1  rl 


—zzdudv  — 


0  JO 


Odudv  —  0. 


0  .70 


Another  face  is  y  —  1  or  r  =  (u,  l,v).  Then  ru  —  (1,0,0),  rv  =  (0,0, 1),  and  ru  x  rv  =  (0,  — 1,0).  We 
need  a  normal  in  the  positive  y  direction,  so  we  convert  this  to  (0, 1,0),  and  the  corresponding  integral  is 


/  /  3 ydudv—  /  /  3 dudv 

Jo  Jo  '  Jo  Jo 


—  3. 


The  remaining  four  integrals  have  values  0,  0,  2,  and  1,  and  the  sum  of  these  is  6,  in  agreement  with 
the  triple  integral.  X 


Example  16.17: 


Let  f=  ( x3,y3,z 2),  and  consider  the  cylindrical  volume  x2  +  v2  <  9,  0  <  z  <  2.  Compute  the  two 
integrals  of  the  divergence  theorem. 


Solution  The  triple  integral  (using  cylindrical  coordinates)  is 

rlK  r'i  C 

Jo 


:  r3  r2 

/  /  (3r2  +  2z)rdzdrd 0  =  219k. 

Jo  Jo 


For  the  surface  we  need  three  integrals.  The  top  of  the  cylinder  can  be  represented  by  r  =  (vcos  u,  v  sin  u,  2) ; 
ru  x  rv  =  (0,0,  —  v),  which  points  down  into  the  cylinder,  so  we  convert  it  to  (0,0,  v).  Then 


c2ti  r3 


(v3cos3  «,v3  sin3M,4)  •  (0,0,  v)  dvdu  = 


r2n  z-3 


to  Jo 


4vdvdu  —  3671. 


Jo  Jo 


The  bottom  is  r  =  (vcosw,vsinM,0);  ru  x  rv  =  (0,0,  —  v)  and 
r2%  r3 


fZ7Z  3  3  3  3  (lK  f 3 

/  /  (v  cos  m,v  sin  m,0)  •  (0,0,  — v)  JvJm  =  /  /  0dvdu  —  0. 

Jo  Jo  Jo  Jo 

The  side  of  the  cylinder  is  r  =  (3cosw,3sinw,v);  ru  x  rv  =  (3cosw,3sinw,0)  which  does  point  outward, 


so 


I  r  2 

/  /  (27cos3f/,27sin3M,v2)  •  {3cosu,3smu,Q)  dvdu 

Jo  Jo 


c2k  r 2 


8 1  cos4  u  +  81  si  mudvdu  =  243 Jt. 


to  Jo 


The  total  surface  integral  is  thus  36^  +  0  +  2437T  =  219k. 


X 
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Exercises  for  16.5 


Exercise  16.5.1  Using  f—  (3 x,y3,  —2 z2)  and  the  region  bounded  by  x2  +y2  =  9,  z  —  0,  and  z—  5,  compute 
both  integrals  from  the  Divergence  Theorem. 


Exercise  16.5.2  Let  E  be  the  volume  described  by  0<x<  a,  0<y<b,  0<z<c,  and  f  —  (x2,y2,z2). 


Compute  //  f-NdS. 

JJdE 

Exercise  16.5.3  Let  E  be  the  volume  described  by  0<x<l,  0<j<1,  0<z<l,  andf  —  (2 xy, 3.r y,zex+y). 


Compute  //  f-NdS. 

JJdE 

Exercise  16.5.4  Let  E  be  the  volume  described  by  9  <x  <\,  0  <y  <x,  0  <  z<  x  +  y,  andf  =  (x,2y,3z). 


Compute  I j  f-NdS. 


Exercise  16.5.5  Let  E  be  the  volume  described  bv  xf  +  y2  +z~  <  4,  andf  —  (x3,_y3,z3).  Compute  //  /• 

JJdE 

NdS. 

Exercise  16.5.6  Lei  E  be  the  hemisphere  described  by  0  <  z  <  y/l-x2-y2,  andf  —  ( \/x2+y2+z2,  \J x2  +y2  +z2, 


Compute  //  f-NclS. 

JJdE 

Exercise  16.5.7  Let  E  be  the  volume  described  by  x2  -\-y2  <  1,  0  <  z  <  4,  and /=  (xy2,yz,x2z).  Compute 

[f  f-NdS. 

JJdE 

Exercise  16.5.8  Let  E  be  the  solid  cone  above  the  x-y  plane  and  inside  z  =  1  —  \J x2  +  y2,  and  f  — 
(j:cos2z,_ysin2z,  ^?T7z).  Compute  JJj-  NdS. 

Exercise  16.5.9  Prove  the  other  two  equations  in  the  display  16.4. 

Exercise  16.5.10  Suppose  D  is  a  closed  surface,  and  that  D  and  F  are  sufficiently  nice.  Show  that 

JJ  (V  x  f)  -  NdS  =  0 

where  N  is  the  outward  pointing  unit  normal. 

Exercise  16.5.11  Suppose  D  is  a  closed  surface,  D  is  sufficiently  nice,  and  F  —  (a,b,c)  is  a  constant 
vector  field.  Show  that 


D 


f-NdS  =  0 


where  N  is  the  outward  pointing  unit  normal. 
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Exercise  16.5.12  We  know >  that  the  volume  of  a  region  E  may  often  be  computed  as 


dxdydz.  Show 


that  this  volume  may  also  be  computed  as 
to  dE. 


1 

3 


(. x,y,z )  ■ NdS  where  N  is  the  outward  pointing  unit  normal 


16.6  Vector  Functions  for  Surfaces 


We  have  dealt  extensively  with  vector  equations  for  curves,  r(f)  =  (x(t),y(t),z(t)).  A  similar  technique 
can  be  used  to  represent  surfaces  in  a  way  that  is  more  general  than  the  equations  for  surfaces  we  have 
used  so  far.  Recall  that  when  we  use  r(t)  to  represent  a  curve,  we  imagine  the  vector  r(t)  with  its  tail  at 
the  origin,  and  then  we  follow  the  head  of  the  arrow  as  t  changes.  The  vector  “draws”  the  curve  through 
space  as  t  varies. 

Suppose  we  instead  have  a  vector  function  of  two  variables, 

r  (u,v)  =  {x(u,v),y(u,v),z(u,v)). 

As  both  u  and  v  vary,  we  again  imagine  the  vector  r(«,v)  with  its  tail  at  the  origin,  and  its  head  sweeps 
out  a  surface  in  space.  A  useful  analogy  is  the  technology  of  CRT  video  screens,  in  which  an  electron 
gun  fires  electrons  in  the  direction  of  the  screen.  The  gun’s  direction  sweeps  horizontally  and  vertically 
to  “paint”  the  screen  with  the  desired  image.  In  practice,  the  gun  moves  horizontally  through  an  entire 
line,  then  moves  vertically  to  the  next  line  and  repeats  the  operation.  In  the  same  way,  it  can  be  useful  to 
imagine  fixing  a  value  of  v  and  letting  r (w,  v)  sweep  out  a  curve  as  u  changes.  Then  v  can  change  a  bit,  and 
r (w,v)  sweeps  out  a  new  curve  very  close  to  the  first.  Put  enough  of  these  curves  together  and  they  form  a 
surface. 


Example  16.18: 


Consider  the  function  r(u,v)  —  (vcosw,  vsinw,  v).  For  a  fixed  value  of  v,  as  u  varies  from  0  to  In, 
this  traces  a  circle  of  radius  v  at  height  v  above  the  x-y  plane.  Put  lots  and  lots  of  these  together, and 
they  form  a  cone,  as  in  Figure  1 6.4. 


Figure  16.4:  Tracing  a  surface. 
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Recall  that  from  the  vector  equation  of  the  curve  we  can  compute  the  unit  tangent  T,  the  unit  normal  N, 
and  the  binormal  vector  B  =  TxN;  you  may  want  to  review  Section  15.3.  The  binormal  is  perpendicular 
to  both  T  and  N;  one  way  to  interpret  this  is  that  N  and  B  define  a  plane  perpendicular  to  T,  that  is, 
perpendicular  to  the  curve;  since  N  and  B  are  perpendicular  to  each  other,  they  can  function  just  as  i  and  j 
do  for  the  x-y  plane. 


Example  16.19: 


The  curve  given  by 

r=  ((2  +  cos(3m/2))cosw,  (2  +  cos(3w/2))  sinw,sin(3n/2)) 

is  called  a  trefoil  knot.  Then  c(v)  =  iVcos  v  +  B  sinv  is  a  vector  equation  for  a  unit  circle  in  a  plane 
perpendicular  to  the  curve  described  by  r,  except  that  the  usual  interpretation  of  c  would  put  its 
center  at  the  origin.  We  can  fix  that  simply  by  adding  c  to  the  original  r:  let  f  —  r{u )  +  c(v).  For 
a  fixed  u  this  draws  a  circle  around  the  point  r(« );  as  u  varies  we  get  a  sequence  of  such  circles 
around  the  curve  r,  that  is,  a  tube  of  radius  1  with  r  at  its  center.  We  can  easily  change  the  radius; 
for  example  r(u)  +  ac(v)  gives  the  tube  radius  a;  we  can  make  the  radius  vary  as  we  move  along  the 
curve  with  r(u )  +  g(u)c(v),  where  g(u)  is  a  function  of  u. 


As  shown  in  Figure  16.5,  it  is  hard  to  see  that  the  plain  knot  is  knotted;  the  tube  makes  the  structure 
apparent.  Of  course,  there  is  nothing  special  about  the  trefoil  knot  in  this  example;  we  can  put  a  tube 
around  (almost)  any  curve  in  the  same  way. 


Figure  16.5:  Tubes  around  a  trefoil  knot,  with  radius  1  / 2  and  3  cos (w)  /4. 

We  have  previously  examined  surfaces  given  in  the  form  f(x,y).  It  is  sometimes  useful  to  represent 
such  surfaces  in  the  more  general  vector  form,  which  is  quite  easy:  r(«,v)  =  (u.,v,f(u,v)).  The  names  of 
the  variables  are  not  important  of  course;  instead  of  disguising  x  and  y,  we  could  simply  write  r(;c,y)  = 
(x,y,f(x,y)). 

We  have  also  previously  dealt  with  surfaces  that  are  not  functions  of  x  and  y;  many  of  these  are  easy 
to  represent  in  vector  form.  One  common  type  of  surface  that  cannot  be  represented  as  z  —  f(x,y )  is  a 
surface  given  by  an  equation  involving  only  x  and  y.  For  example,  x  +  y  —  1  and  y  —  x2  are  “vertical” 
surfaces.  For  every  point  (jc,y)  in  the  plane  that  satisfies  the  equation,  the  point  (x,y,z)  is  on  the  surface, 
for  every  value  of  z.  Thus,  a  corresponding  vector  form  for  the  surface  is  something  like  (f(u),g(u),v); 
for  example,  x  +  y  —  1  becomes  (w,  1  —  u,  v)  and  y  —  x2  becomes  (u,  u2,v). 
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Yet  another  sort  of  example  is  the  sphere,  say  x2  +  y2  +  z2  —  1 .  This  cannot  be  written  in  the  form 
z  =  f(x,y),  but  it  is  easy  to  write  in  vector  form;  indeed  this  particular  surface  is  much  like  the  cone, 
since  it  has  circular  cross-sections,  or  we  can  think  of  it  as  a  tube  around  a  portion  of  the  z-axis,  with 
a  radius  that  varies  depending  on  where  along  the  axis  we  are.  One  vector  expression  for  the  sphere  is 
(v/r^  V 2  cos  u,  vT3  v2sinw,v) — this  emphasizes  the  tube  structure,  as  it  is  naturally  viewed  as  drawing  a 
circle  of  radius  V 1  —  v2  around  the  z-axis  at  height  v.  We  could  also  take  a  cue  from  spherical  coordinates, 
and  write  (sin «  cos  v,  sin  w  sin  v,  cos  w),  where  in  effect  u  and  v  are  (j>  and  0  in  disguise. 

It  is  quite  simple  to  use  a  computer  program  to  plot  any  surface  for  which  you  have  a  vector  represen¬ 
tation.  Using  different  vector  functions  sometimes  gives  different  looking  plots,  because  the  computer  in 
effect  draws  the  surface  by  holding  one  variable  constant  and  then  the  other. 

Here’s  a  simple  but  striking  example:  the  plane  x  +  y  +  z  =  1  can  be  represented  quite  naturally  as 
(«,  v,  1  —  u  —  v).  However  we  could  also  think  of  painting  the  same  plane  by  choosing  a  particular  point 
on  the  plane,  say  (1,0,0),  and  then  drawing  circles  or  ellipses  (or  any  of  a  number  of  other  curves)  as  if 
that  point  were  the  origin  in  the  plane.  For  example,  ( 1  —  v  cos  u  —  v  sin  u,  v  sin  u,  v  cos  u)  is  one  such  vector 
function.  Note  that  while  it  may  not  be  obvious  where  this  came  from,  it  is  quite  easy  to  see  that  the  sum 
of  the  x,  y,  and  z  components  of  the  vector  is  always  1.  Computer  renderings  of  the  plane  using  these  two 
functions  are  shown  in  Figure  16.6. 


i 


Figure  16.6:  Two  representations  of  the  same  plane. 

Suppose  we  know  that  a  plane  contains  a  particular  point  (xo,yo>  zo)  and  that  two  vectors  u  =  (no,  u\ ,  ui) 
and  v  =  (vo,vi,  V2)  are  parallel  to  the  plane  but  not  to  each  other.  We  know  how  to  get  an  equation  for  the 
plane  in  the  form  ax  +  by  +  cz  —  d,  by  first  computing  u  x  v.  It’s  even  easier  to  get  a  vector  equation: 

r(u,v)  =  {xo,yo,zo)  +uu  +  v\. 

The  first  vector  gets  to  the  point  (xo,yo,Zo)  and  then  by  varying  u  and  v,  nu  + vv  gets  to  every  point  in  the 
plane. 

Returning  to  x+y  +  z  —  1,  the  points  (1,0,0),  (0, 1,0),  and  (0,0, 1)  are  all  on  the  plane.  By  subtracting 
coordinates  we  see  that  (—1,0, 1)  and  (—1, 1,0)  are  parallel  to  the  plane,  so  a  third  vector  form  for  this 
plane  is 

(1,0,0)  +  u{- 1,0,1)  +  v(— 1,1,0)  =  (1  —  u  —  v,v,u). 
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This  is  clearly  quite  similar  to  the  first  form  we  found. 

We  have  already  seen  (Section  14.4)  how  to  find  the  area  of  a  surface  when  it  is  defined  in  the  form 
f(x,y).  Finding  the  area  when  the  surface  is  given  as  a  vector  function  is  very  similar.  Looking  at  the  plots 
of  surfaces  we  have  just  seen,  it  is  evident  that  the  two  sets  of  curves  that  fill  out  the  surface  divide  it  into 
a  grid,  and  that  the  spaces  in  the  grid  are  approximately  parallelograms.  As  before  this  is  the  key:  we  can 
write  down  the  area  of  a  typical  small  parallelogram  and  add  them  all  up  with  an  integral. 

Suppose  we  want  to  approximate  the  area  of  the  surface  r(w,v)  near  r(«o,vo)-  The  functions  r(w,vo) 
and  r(«o,v)  define  two  curves  that  intersect  at  r(«o,vo).  The  derivatives  of  r  give  us  vectors  tangent  to 
these  two  curves:  ru(uo,\’o)  and  rv(wo,vo),  and  then  tu(uo,vq) du  and  rv(uo,vo) dv  are  two  small  tangent 
vectors,  whose  lengths  can  be  used  as  the  lengths  of  the  sides  of  an  approximating  parallelogram.  Finally, 
the  area  of  this  parallelogram  is  \ru  x  rv|  dudv  and  so  the  total  surface  area  is 


rJ  dudv 


(16.7) 


Solution.  We  compute  ru  —  (— vsinw,vcosw,0)  and  rv  =  (cos  u,  sin  u,  1).  The  cross  product  of  these  two 
vectors  is  (vcos  w,vsinw,  — v)  with  length  \/2v2  —  \J~2v,  and  the  surface  area  is 


X 


Exercises  for  16.6 


Exercise  16.6.1  Describe  or  sketch  the  surface  with  the  given  vector  function. 

(a)  r(u,v)  =  (6v  —  u,3  +  v,  1  —  4v) 

(b)  r(u,v)  =  (2cosw,5cos«,v) 

(c)  r(s,t)  —  {u  +  v,2u  +  v, u2  +  v2) 

(d)  r(s,t)  =  (sinw  +  cosw,w,v) 

Exercise  16.6.2  Find  the  area  of  the  portion  of  x  +  2y  +  4z 
Exercise  16.6.3  Find  the  area  of  the  portion  of2x  +  4y  +  z 


—  10  in  the  first  octant. 

—  0  inside  x2  +y2  =  1. 
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Exercise  16.6.4  Find  the  area  ofz  =  x2  Fy2  that  lies  below  z  =  1. 

Exercise  16.6.5  Find  the  area  ofz  =  \Jx2  +  v2  that  lies  below  z.  =  2. 

Exercise  16.6.6  Find  the  area  of  the  portion  of  x 2  +  y2  +  z2  =  a2  that  lies  in  the  first  octant. 

Exercise  16.6.7  Find  the  area  of  the  portion  of  x 2  +  y2  +  z2  —  a2  that  lies  above  x2  +  y2  <  b2. 

Exercise  16.6.8  Find  the  area  ofz  —  x2  —  y2  that  lies  inside  x2  Fy2  —  a2. 

Exercise  16.6.9  Find  the  area  of  x2  Fy 2  F  z2  =  a2  that  lies  above  the  interior  of  the  circle  given  in  polar 
coordinates  by  r  —  a  cos  6. 


Exercise  16.6.10  Find  the  area  of  the  cone  z.  —  k  \Jx2  F  y2  that  lies  above  the  interior  of  the  circle  given 
in  polar  coordinates  by  r  —  a  cos  0. 

Exercise  16.6.11  Find  the  area  of  the  plane  z  =  axFbyFc  that  lies  over  a  region  D  with  area  A. 

Exercise  16.6.12  Find  the  area  of  the  cone  z  =  k \Jx2  +  y2  that  lies  over  a  region  D  with  area  A. 

Exercise  16.6.13  Find  the  area  of  the  cylinder  x  Fz  —  a~  that  lies  inside  the  cylinder  x~  F  y  —  a  . 

Exercise  16.6.14  The  surface  f(x,y )  can  be  represented  with  the  vector  function  (x,y,f(x,y)).  Set  up  the 
surface  area  integral  using  this  vector  function  and  compare  to  the  integral  of  Section  14.4. 


16.7  Surface  Integrals 


In  the  integral  for  surface  area,  from  Equation  16.7, 

rb  rd 


\ru  x  rv|  dudv. 


the  integrand  |r„  x  rv|  dudv  is  the  area  of  a  tiny  parallelogram,  that  is,  a  very  small  surface  area,  so  it  is 
reasonable  to  abbreviate  it  dS;  then  a  shortened  version  of  the  integral  is 


IdS. 


D 


We  have  already  seen  that  if  D  is  a  region  in  the  plane,  the  area  of  D  may  be  computed  with 


D 


1  dA, 


so  this  is  really  quite  familiar,  but  the  dS  hides  a  little  more  detail  than  does  dA. 
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Just  as  we  can  integrate  functions  f(x,y )  over  regions  in  the  plane,  using 


JJDf(x,y)dA, 

so  we  can  compute  integrals  over  surfaces  in  space,  using 

ff  f  (x,y,z)  dS. 

In  practice  this  means  that  we  have  a  vector  function  r (u,v)  —  (x(u,v),y(u,v),z(u,v)}  for  the  surface,  and 
the  integral  we  compute  is 


b  rd 


f(x(u,v),y(u,v),z(u,v))\ru  x  rv|  dudv. 

That  is,  we  express  everything  in  terms  of  u  and  v,  and  then  we  can  do  an  ordinary  double  integral. 


Example  16.21:  Mass  and  Center  of  Mass 


Suppose  a  thin  object  occupies  the  upper  hemisphere  ofx2+y2  +  z2  —  1  and  has  density  o(x,y,z )  = 
z.  Find  the  mass  and  center  of  mass  of  the  object.  (Note  that  the  object  is  just  a  thin  shell;  it  does 
not  occupy  the  interior  of  the  hemisphere.) 


Solution.  We  write  the  hemisphere  as  r (0,0)  =  (cos  6  sin 0,  sin  6  sin 0, cos 0),  0  <  0  <  tt/2  and  0  <  0  < 
2k.  So  re  =  (—  sin0sin0,cos0sin0,O)  and  =  (cos  0  cos  0,  sin  0  cos  0,  —  sin0).  Then 

re  x  =  (—  cos0sin20,—  sin0sin20,—  cos0sin0) 


and 


| r Q  x  Tty |  =  |  sin 0|  =  sin0, 

since  we  are  interested  only  in  0  <  0  <  k/2.  Finally,  the  density  is  z 


cos  0  and  the  integral  for  mass  is 


cos  0  sin  (j)  d(f)  dO  =  n. 


By  symmetry,  the  center  of  mass  is  clearly  on  the  z-axis,  so  we  only  need  to  find  the  z-coordinate  of 
the  center  of  mass.  The  moment  around  the  x-y  plane  is 

r  2k  rn/2  r2n  t-Ti/l  2k 

/  /  z  cos  0  sin  0<i0  r/0  =  /  /  cos20sin  (jdtjdO  — 

Jo  Jo  Jo  Jo  3 

so  the  center  of  mass  is  at  (0,0, 2/3).  X 

Now  suppose  that  f  is  a  vector  field;  imagine  that  it  represents  the  velocity  of  some  fluid  at  each  point 
in  space.  We  would  like  to  measure  how  much  fluid  is  passing  through  a  surface  D,  the  flux  across  D.  As 
usual,  we  imagine  computing  the  flux  across  a  very  small  section  of  the  surface,  with  area  dS,  and  then 
adding  up  all  such  small  fluxes  over  D  with  an  integral.  Suppose  that  vector  N  is  a  unit  normal  to  the 
surface  at  a  point;  f  ■  N  is  the  scalar  projection  of  f  onto  the  direction  of  N,  so  it  measures  how  fast  the  fluid 
is  moving  across  the  surface.  In  one  unit  of  time  the  fluid  moving  across  the  surface  will  fill  a  volume  of 
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f-NdS,  which  is  therefore  the  rate  at  which  the  fluid  is  moving  across  a  small  patch  of  the  surface.  Thus, 
the  total  flux  across  D  is 


f-N  dS  = 


D 


f  d S, 


D 


defining  d S  =  N  dS. 


As  usual,  certain  conditions  must  be  met  for  this  to  work  out;  chief  among  them  is  the  nature  of  the 
surface.  As  we  integrate  over  the  surface,  we  must  choose  the  normal  vectors  N  in  such  a  way  that  they 
point  “the  same  way”  through  the  surface.  For  example,  if  the  surface  is  roughly  horizontal  in  orientation, 
we  might  want  to  measure  the  flux  in  the  “upwards”  direction,  or  if  the  surface  is  closed,  like  a  sphere, 
we  might  want  to  measure  the  flux  “outwards”  across  the  surface.  In  the  first  case  we  would  choose  N 
to  have  positive  z  component,  in  the  second  we  would  make  sure  that  N  points  away  from  the  origin. 
Unfortunately,  there  are  surfaces  that  are  not  orientable:  they  have  only  one  side,  so  that  it  is  not  possible 
to  choose  the  normal  vectors  to  point  in  the  “same  way”  through  the  surface.  The  most  famous  such 
surface  is  the  Mobius  strip  shown  in  Figure  16.7.  It  is  quite  easy  to  make  such  a  strip  with  a  piece  of  paper 
and  some  tape.  If  you  have  never  done  this,  it  is  quite  instructive;  in  particular,  you  should  draw  a  line 
down  the  center  of  the  strip  until  you  return  to  your  starting  point.  No  matter  how  unit  normal  vectors  are 
assigned  to  the  points  of  the  Mobius  strip,  there  will  be  normal  vectors  very  close  to  each  other  pointing 
in  opposite  directions. 


Figure  16.7:  A  Mobius  strip. 


Assuming  that  the  quantities  involved  are  well  behaved,  however,  the  flux  of  the  vector  field  across  the 
surface  r(w,v)  is 


f-N  dS 


D 


r  u  x  rv 
ru  x  rv 


ru  x  rv|  dA 


r  v)dA. 


In  practice,  we  may  have  to  use  rv  x  ru  or  even  something  a  bit  more  complicated  to  make  sure  that  the 
normal  vector  points  in  the  desired  direction. 


Example  16.22:  Flux 


Compute  the  flux  off =  (x,  y,  z2)  across  the  cone  z  —  \Zx2  +y2,  0  <  z  <  2,  in  the  downward  direction. 
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Solution.  We  write  the  cone  as  a  vector  function:  r  =  (vcosu,  vsinu,  v),  0  <  u  <  2tz  and  0  <  v  <  2.  Then 
ru  —  (— vsinM,vcosw,0)  and  rv  =  (cosw,sinn,  1)  and  ru  x  rv  =  (vcosw,vsinw,  —  v).  The  third  coordinate 
—v  is  negative,  which  is  exactly  what  we  desire,  that  is,  the  normal  vector  points  down  through  the  surface. 
Then 


(x,y,z2)  ■  (vcosm, vsim<,  — v)  dvdu 


r\ 

xvcosu+yvsinu  —  z  vdvdu 
v 2  cos2  u  +  v2  sin2  u  —  v3  dvdu 


9  9 

v  —v  dvdu 


8  K 

T' 


X 


Exercises  for  16.7 


Exercise  16.7.1  Find  the  center  of  mass  of  an  object  that  occupies  the  upper  hemisphere  ofx2  +y2  +  z2  =  1 
and  has  density  x 2  Fy2. 

Exercise  16.7.2  Find  the  center  of  mass  of  an  object  that  occupies  the  surface  z  —  xy,  0  <  x  <  1,  0<y<l 
and  has  density  \J  1  +x2  +y2. 

Exercise  16.7.3  Find  the  center  of  mass  of  an  object  that  occupies  the  surface  z  —  \J x2  +y2,  1  <  z  <  4 
and  has  density  x2z. 


Exercise  16.7.4  Find  the  centroid  of  the  surface  of  a  right  circular  cone  of  height  h  and  base  radius  r,  not 
including  the  base. 


Exercise  16.7.5  Evaluate 
oriented  up. 


(2,  —3,4)  •  NdS,  where  D  is  given  by  z. 


—  x2  +y2,  —  1  <  x  <  1,  —  1  <  y  <  1, 


Exercise  16.7.6  Evaluate 
oriented  up. 


(x,y,  3)  -  NdS,  where  D  is  given  by  z. 


—  3x  —  5y,  1  <  x  <  2,  0  <  y  <  2, 


Exercise  16.7.7  Evaluate 
up. 


—2)  -  NdS,  where  D  is  given  by  z 


1  —x2  —y2,  x2  +  y2  <  1,  oriented 


Exercise  16.7.8  Evaluate 
up. 


■NdS,  where  D  is  given  by  z 


xy,  0  <  x  <  1,  —x<y<x,  oriented 


Exercise  16.7.9  Evaluate 
up. 


(xz,yz,z)  -  NdS,  where  D  is  given  by  z 


=  a2  —  x2  —y2,  x2  +y2  <  b2,  oriented 
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Exercise  16.7.10  A  fluid  has  density  870  kg/m 3  and  flows  with  velocity  v  =  (z,y2,x2),  where  distances  are 
in  meters  and  the  components  of  v  are  in  meters  per  second.  Find  the  rate  of  flow  outward  through  the 
portion  of  the  cylinder  x2  +  y2  =  4,  0  <  z  <  1  for  which  y  >  0. 

Exercise  16.7.11  Gauss’s  Law  says  that  the  net  charge,  Q,  enclosed  by  a  closed  surface,  S,  is 

Q  =  £o  JJ  e-NdS 

where  e  is  an  electric  field  and  Eq  (the  permittivity  of  free  space )  is  a  known  constant;  N  is  oriented 
outward.  Use  Gauss’s  Law  to  find  the  charge  contained  in  the  cube  with  vertices  (±1,±1,±1)  if  the 
electric  field  is  e  =  ( x,y,z ). 


16.8  Stokes’  Theorem 


Recall  that  one  version  of  Green’s  Theorem  (see  Equation  16.2)  is 


Here  D  is  a  region  in  the  x-y  plane  and  k  is  a  unit  normal  to  D  at  every  point.  If  D  is  instead  an  orientable 
surface  in  space,  there  is  an  obvious  way  to  alter  this  equation,  and  it  turns  out  still  to  be  true: 


Theorem  16.23:  Stokes’  Theorem 


Provided  that  the  quantities  involved  are  sufficiently  nice,  and  in  particular  if  D  is  orientable, 

[  f  dr=  [[  (V  x  f)  •  NdS, 

JdD  JJd 

if  dD  is  oriented  counter-clockwise  relative  to  N. 


The  proof  of  Stokes’  Theorem  will  follow  a  discussion  and  several  examples  of  the  Theorem  in  use. 
Note  how  little  has  changed:  k  becomes  N,  a  unit  normal  to  the  surface,  and  dA  becomes  dS,  since  this 
is  now  a  general  surface  integral.  The  phrase  “counter-clockwise  relative  to  N”  means  that  if  we  take  the 
direction  of  N  to  be  “up”,  then  we  go  around  the  boundary  counter-clockwise  when  viewed  from  “above”. 


Example  16.24: 


Let  f  =  (y2z,x2z,xy2)  and  the  surface  D  be  x—  \J  1  —  y2  —  z1,  oriented  in  the  positive  x  direction.  It 
quickly  becomes  apparent  that  the  surface  integral  in  Stokes’  Theorem  is  intractable,  so  compute 
the  line  integral. 


Solution.  The  boundary  of  D  is  the  unit  circle  in  the  y-z  plane,  r  =  (0,  cos  u,  sin  u)  ,0<u<  lit.  The  integral 
is 

r2n  r2n 


/  (y2z,x2z,xy2)  ■  (0,  —  smu, cos u)du  —  /  0du  =  0, 
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because  x  —  0. 


X 


An  interesting  consequence  of  Stokes’  Theorem  is  that  if  D  and  E  are  two  orientable  surfaces  with  the 
same  boundary,  then 


(V  x  f)  -  NdS  = 


Sometimes  both  of  the  integrals 


(Vxf)-N  dS 


and 


{■dr 


are  difficult,  but  you  may  be  able  to  find  a  second  surface  E  so  that 


has  the  same  value  but  is  easier  to  compute. 

In  the  previous  example,  the  line  integral  was  easy  to  compute.  But  we  might  also  notice  that  another 
surface  E  with  the  same  boundary  is  the  flat  disk  y2  +  z2  <  1 . 


Example  16.25: 


Let  f  =  (y2z,x2z,xy2)  and  the  surface  E  bey2  +z2  <  1 .  Compute  the  surface  integral. 


Solution.  The  unit  normal  N  for  this  surface  is  simply  i  =  (1,0,0).  We  compute  the  curl: 

V  x  f  =  (2 xy  —  x2 , 0, 2 xz  —  2 yz) . 

Since  x  —  0  everywhere  on  the  surface, 

( V  x  f)  •  N  =  (0, 0, 2xz  -  2 yz)  ■  ( 1 , 0, 0)  =  0, 


so  the  surface  integral  is 


0  dS  =  0, 


as  before.  In  this  case,  of  course,  it  is  still  somewhat  easier  to  compute  the  line  integral,  avoiding  V  x  f 
entirely.  X 


Now  let’s  look  at  the  proof  of  Stokes’  Theorem. 

Proof.  We  can  prove  here  a  special  case  of  Stokes’  Theorem,  which  perhaps  not  too  surprisingly  uses 
Green’s  Theorem. 

Suppose  the  surface  D  of  interest  can  be  expressed  in  the  form  z  =  g(x,y),  and  let  f  =  (fi,f2,ff).  Using 
the  vector  function  r  =  (x,y,g(x,y))  for  the  surface  we  get  the  surface  integral 


Vxf-  dS  = 


D 


,dfo  df2  dfi  df3  df2  dfi 


dy 

dh 

dx 


dz  ’  dz 


gx  + 


dfi 

dz 


gx ' 


dx 

dfi 

dz 


dx  dy 


)  ■  (~gx. 


dh  ,  df2 


S.v  +  ^S.v  + 


-gy,\)dA 
dfi 


dx  dy 


dA. 
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Here  E  is  the  region  in  the  x-y  plane  directly  below  the  surface  D. 

For  the  line  integral,  we  need  a  vector  function  for  dD.  If  (x(t),y(t))  is  a  vector  function  for  dE  then 
we  may  use  r (t)  —  (x(t),y(t),g(x(t),y(t)))  to  represent  dD.  Then 


fb  dx  dy  dz  , 

L  f'di+fidt+hdtd‘ 


b  dx  dy  f  dz  dx  dzdy\ 
-  1  dt  2  dt  ^  2  \dx  dt  dy  dt ) 


dt. 


using  the  chain  rule  for  dz/dt.  Now  we  continue  to  manipulate  this: 


ch  dx  dy  ( dz  dx  - y 

n~r+  h-r+h  -y--r  +  ^r-r: 


dt 


'  dt 


dx  dt 
b 


dy  dt ) 


n+hd4dd4+(fi+hs4d  $ 


dx  J  dt 


dy  J  dt 


IdE 


dz 


'  dx 


fi  +/3^-  dx+  h  +  h^p-  dy. 


dz 


'dy 


dt 


which  now  looks  just  like  the  line  integral  of  Green’s  Theorem,  except  that  the  functions  f\  and  /2  of 
Green’s  Theorem  have  been  replaced  by  the  more  complicated  /j  +  fs(dz/dx)  and  ./i  +  h{dz./ dy).  We 
can  apply  Green’s  Theorem  to  get 


Now  we  can  use  the  chain  rule  again  to  evaluate  the  derivatives  inside  this  integral,  and  it  becomes 


dfi  ,  df2  ,  <9/3  ,  <9/3 


dx 


+ 


dz 

dfi 


8x 


+  ~dz^y  bsyx  — 


d,h  ,  <9/3  <9/1  <9/1 


+  8x  + 


dx  '  dz  dx  dy  dz  by  dy 
which  is  the  same  as  the  expression  we  obtained  for  the  surface  integral. 


Sy 


dj\  dj\  ,  <9/3  <9/3 

dy  dz 
<9/3 


8y  9"  ^  ^  8x  +  8y8x  +  fz8xy 


dA 


SxdA, 


Exercises  for  16.8 


Exercise  16.8.1  Let  f  —  ( z,x,y ).  The  plane  z  —  2x  +  2y  —  1  and  the  paraboloid  z  —  x2  +  y2  intersect  in  a 
closed  cur\’e.  Stokes’  Theorem  implies  that 


(V  xf)-NdS, 


where  the  line  integral  is  computed  over  the  intersection  C  of  the  plane  and  the  paraboloid,  and  the  two 
surface  integrals  are  computed  over  the  portions  of  the  two  surfaces  that  have  boundary  C  (provided,  of 
course,  that  the  orientations  all  match).  Compute  all  three  integrals. 


Exercise  16.8.2  Let  D  be  the  portion  of  z.  —  1 
(xy2,—x2y,xyz).  Compute  //  (V  x /)  •  NdS. 


y2  above  the  x-y  plane,  oriented  up,  and  let  f  = 


JJd 
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Exercise  16.8.3  Let  D  be  the  portion  of  z  =  2x  +  5y  inside  x2  +  y2  =  1,  oriented  up,  and  let  f  —  ( y,z ,  —x). 

Compute  /  /•  dr. 

JdD 


Exercise  16.8.4  Compute 
clockwise. 


/  9  9  9  9 

j>  x  zdx  +  3xdy—y  dz,  where  C  is  the  unit  circle  x  +  v" 


1  oriented  counter- 


Exercise  16.8.5  Let  D  be  the  portion  of  z  —  px  +  qy  +  r  over  a  region  in  the  x-y  plane  that  has  area  A, 

oriented  up,  and  let  f  =  ( ax  +  bv  +  cz,  ax  +  by  +  cz,  ax  +  by  +  cz) .  Compute  /  /•  dr. 

JdD 

Exercise  16.8.6  Let  D  be  any  surface  and  letf=  (j\  (.r),/2(.y),/3(<:))  (J'\  depends  only  on  x,  /2  only  on  y, 

and  /3  only  on  z).  Show  that  /  /•  dr  =  0. 

JdD 


Exercise  16.8.7  Show  that 
Theorem  applies. 


JcfVg  +  gVf-dr 


0,  where  r  describes  a  closed  curve  C  to  which  Stokes’ 


Selected  Exercise  Answers 


1.1.1  (a)  - 


(b)  ^Jxy 


1.1.2  a  —  2,b  —  —  |,  c  = 

1.1.3  x—  —4  and  *  =  6. 

1.1.5  (a)  (5/3,  oo) 

(b)  [1/7, 2/7] 

(c)  (— oc—3)  U  (-2,1] 

(d)  (-00,00) 

(e)  No  solution 

(f)  (-00, 1)  U  (l,oo) 

(g)  (-2,0)  U  (2-) 

(h)  [4,oo)  u{0} 

(i)  (0,i) 

(j)  (-2,-1]  U  (1,4] 

1.1.6  x  =  —  \  andx  =  —  g. 

1.1.7  (a)  (—00, —2]  U  [2,oo) 

(b)  [2,4] 

(c)  (-00,  -9/2]  U  [-l/2,oo) 


(d)  (4,oo) 

(e)  (-00,00) 

(f)  (-9, -6)  U  (4,7) 


1.2.1  (a)  (2/3)x+  (1/3) 

(b)  y  —  —2x 

(c)  y  =  (— 2/3)x+  (1/3) 


579 
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(d)  y  =  — x/3  +  17/3 

(e)  y  =  —  l/2x  +  5/2 

1.2.2  (a)  y  —  2x  +  2,2,—l 

(b)  y  =  —x  +  6,  6,  6 

(c)  y  =  x/2  +  1/2,  1/2,  -1 

(d)  y  =  3/2,  y-i intercept:  3/2,  no  x-intercept 

(e)  y  =  (— 2/3)x  —  2,  —2,  —3 

1.2.3  Yes,  the  lines  are  parallel  as  they  have  the  same  slope  of  —1/2 

1.2.4  y  =  0,  y  =  — 2x  +  2,  y  =  2x  +  2 

1.2.5  y=  (9/5)x  +  32,  (-40,-40) 

1.2.6  y  =  0.15x+  10 

1.2.7  0.03x+  1.2 

1.2.8  (a)  P  =  —0.000  lx  +  2 
(b)x  =  — 10000P  +  20000 

1.2.9  (2/25)x—  (16/5) 

1.2.10  (a)  2 

(b)  V2 

(c)  V2 

1.2.12  (a)  x2  +y2  =  9 

(b)  (x  —  5)2  +  (y  —  6)2  =  9 

(c)  (x  +  5)2  +  (y  +  6)2  =  9 

1.2.14  (a)  circle 

(b)  ellipse 

(c)  horizontal  parabola 

1.2.15  (x  +  2/7)2  +  (y  — 41/7)2  =  1300/49 
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1.3.1  2htc  —  7t/2,  any  integer  n 

1.3.2  nK  ±k/6,  any  integer  n 

1.3.4  -J 

1.3.5  sin  0  —  —x/  \Jx2  +  \  ,  cos  6  =  —  1  /  Vx2  + l 

1.3.6  —  =j-  is  the  unique  answer. 

1.3.7  (V2  +  \/6)/4 

1.3.8  -(l  +  V3)/(l-v^)=2  +  v^ 

1.3.11  t  =  k/2 


1.4.1 

(b) 


1.4.2  (a)  -13/5 

(b)  -1/2,3 

(c)  (1 ± \/l3)/2 

(d)  No  real  solutions 

(e)  y/3 


1.4.3  Counter-examples  may  vary. 

(a)  x  —  3 

(b)  x  —  h—  1 

(c)  *  =  y  =  1 

1.4.4  x  -|-  3y  —  13  —  0,  or  equivalents  such  as  y  —  —  ^x+ 

1.4.5  (-oo,0]  U(i, 3] 

1.4.6  It  is  impossible  for  both  x  —  2  and  1  —  x  to  be  non-negative  for  the  same  real  number  x. 

1.4.7  6x  +  3/z 


1.4.8  — 1/  [(2x  +  2h—  l)(2x—  1)] 
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1.4.9  -3 

1.4.10  7r/6,  5tt/ 6 

1.4.11  2k/5 

1.4.12  It  is  equal  to  2  for  all  x  larger  than  4. 

1.4.13  (x  +  2)2  +  (y  —  3)2  =  25. 

1.4.14  Centre  is  (—3,2)  and  radius  is  2. 

1.4.15  y  could  be  any  real  number  greater  than  or  equal  to  6. 

1.4.16  xV/(36z8) 

1.4.17  x  —  (y  —  2)/(3  +  Ay) 

1.4.18  Q[x)=x+\,R  =  -1 

2.1.1  (a)  (x  |  x  G  M},  i.e.,  all  x 

(b)  {x  |  x  >  3/2} 

(c)  {x  |  x  ^  — 1} 

(d)  (x  |  x  7^  1  and  x  7^  —  1 } 

(e)  (x  |  x  <  0} 

(f)  (x  |  x  G  M},  i.e.,  all  x 

(g)  {x  \  h  — r  <  x  <  h  + r} 

(h)  (x  |  x  >  0} 

(i)  (x  |  —  1  <  x  <  1 } 

(j)  \*>  1} 

(k)  (x  |  -1/3  <x<  1/3} 

(l)  (x  |  x  >  0  and  x  7^  1 } 

(m)  (x  |  x  >  0  and  x  7^  1 } 

2.1.2  A  =  x(500  -  2x),  (x  |  0  <  x  <  250} 

2.1.3  V  =  r(50  -  nr2),  {r  \  0  <  r  <  ^50 /n) 
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2.1.4 

A  =  2 Kr2  +  2000/ r,  {r  |  0  <  r  <  °o} 

2.2.3 

IV 

'tr 

* 

IV 

2.3.1 

;y  =  2T 

2.3.2 

y  =  7 

2.3.3 

y  =  2 

2.3.4 

x  7^  0 

2.6.1 

(a)  k/3 

(b) 

3k/4 

2.6.2 

(a)  n/4 

(c) 

(b) 

-k/3 

(d) 

2.6.3 

Vl  —  x2/x  with  domain  [—1,0)  U  (0, 1]. 

2.8.1 

(d) 

rl 

CO 

ri 

3/[5(x  +  3)] 

2.8.3 

6  +  h 

2.8.4 

(a)  [2,3)  U  (3,°o) 

(b) 

(— °°,  -3)  U  (-3, 3)  U  (3,°°) 

2.8.5 

{x  :  x  ^  0} 

so 

CO 

c4 

g(x)  =  (5x  +  26)/3 

t- 

QC 

ri 

(c) 

2.8.8 

/  1  (x)  —  f(x)  —  ln  (  — - —  )  and  its  domain  is  (0,°° 
\ex  —  1 ) 

2.8.9 

(a)  2  — ln3 

(b) 

1,3 

(c) 

(e2-l)2 

(d) 

3 

2.8.10  -K 

2.8.11  2tt  /  5 

2.8.12  1 


3.3.1 
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(a)  8 

(e)  -1 

(i)  3 

(b)  6 

(f)  8 

(j)  -3/2 

(c)  dne 

(g)  7 

00  6 

(d)  -2 

(h)  6 

(1)  2 

3.4.1  (a)  7 

(h)  172 

(b)  5 

(i)  0 

(c)  0 

(j)  2 

(d)  undefined 

(k)  does  not  exist 

(e)  1/6 

(1)  y/2 

(f)  o 

(m)  3 a2 

(g)  3 

(n)  512 

3.4.2  L  =  0andAf  =  1.  No. 

3.5.1  (a)  1 

(g)  00 

(m)  1/2 

(b)  1 

(h)  2/7 

(n)  5 

(c)  — oo 

(i)  2 

(o)  2V2 

(d)  1/3 

(j)  -°° 

(P)  3/2 

(e)  0 

(k)  oo 

(q)  °° 

(f)  oo 

(1)  o 

(r)  does  not  exist 

3.5.2  y  =  1  and  y  —  —  1 

3.5.3  x  =  0  and  x  —  2. 

3.5.5  y  =  x  +  4 

3.5.6  (a)  — oo 
(b)  tt/2 

(C)  0 
(d)  oo 


(e)  -5 
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(f)  1/3 

(g)  0 

3.6.1  (a)  5 

(b)  7/2 

(c)  3/4 

(d)  1 

(e)  -Vl/2 

3.6.2  7 

3.6.3  2 


3.6.5  3 


4.1.1  -5,  -2.47106145,  -2.4067927,  -2.400676,  -2.4 


4.1.2  -4/3,  -24/7,  7/24,  3/4 

4.1.3  -0.107526881,  -0.11074197,  -0.1110741,  _1  -  -+  Vr 

3(3  + Ax)  9 


4.1.4  3  +  )Ar  +  At2  ->  3 
1  +  Ar 


4.1.5  3.31,3.003001,3.0000, 
3  +  3Ax  +  Ax~  — y  3 


4.1.6  m 

4.1.9  10,  25/2,  20,  15,  25,  35. 

4.1.10  5,  4.1,  4.01,4.001,4  + A/— >4 

4.1.11  -10.29,-9.849,-9.8049, 
-9.8-4.9At  ^  -9.8 

4.2.1  (a)  — x/  V 169  —x2 

(b)  — 9.8t 

(c)  2x+  1/x2 

(d)  2 ax  +  b 
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(e)  3x2 

(f)  -2/(2x+l)3>2 

(g)  5/(1 +  2)2 

4.2.4  y  =  —  I3x  +  17 

4.2.5  -8 


4.3.1  (a)  100x" 

(g)  15x2  +  24x 

0) 

(b)  —  lOOx-101 

(h)  —  20x4  +  6x  +  1 0/x3 

(c)  — 5x^6 

(i)  -30x  +  25 

(m) 

(d)  nxK  1 

(j)  3x2  +  6x—  1 

(n) 

(e)  (3/ 4)x_1//4 

x2  +  2x  +  5 

(1)  — (9/7)x^16/7 

()  (x2  +  2x  — 3)2 

(o) 

4.3.2  y  =  13x/4  +  5 

4.3.3  y  —  24x  —  48  —  7T3 

4.3.4  — 49t/5  +  5,  —49/5 

n 

4.3.6  ^ 

k=\ 

4.3.7  x3/16  — 3x/4  +  4 


4.3.10  f  —  4(2x  —  3),  y  —  4x  —  l 


4.3.12 


3x2 

x3  —  5x  +  10 


x3(3x2  —  5) 
(x3  —  5x+  10) 


4.3.13 


2,x  +  5 

x5  —  6x 3  +  3x 2  —  7x+l 


+  5x  —  3)  (5x4  —  18x2  +  6x  —  7) 
(x5  —  6x3  +  3x2  —  lx  +  1 ) 2 


4.3.14 


x—  1250 

2(x-  625)3/2 


4.3.15 


200  -39x 
2x21  \J x  —  5 


3x2(x3  —  5x  +  10)  + 
x3(3x2  —  5) 

x4(7x2  +  30x—  15) 

— 3x2  —  20x+  15 

Xs 

3x(5x  +  8) 

_  (5x3  +  12x2  —  15)2 


4.3.16  y=  17x/4-41/4 
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4.3.17  y=  IIjc/16-  15/16 


4.3.18  13/18 

4.4.2  n/6  +  2nn,  5n/6  +  Inn,  any  integer  n 
4.5.1  4x3  —  9x2  +  x  +  7 


4.5.2  3x2  —  4x  +  2/ \fx 

4.5.3  6(x2  +  l)2x 

4.5.4  169  —x2  —  x2 / V 169  —  x2 

4.5.5  (2x  — 4)  \/25  —  x2  — 

(x2  —  4x  +  5)x/  V25  —x2 


4.5.6  —x/  V  r2  —  x 2 


4.5.7  2x~ 


4.5.8 


1 


4y/x(5  —  a/^)3/2 

4.5.9  6+  18x 


.  ,  2x  +  1  x2+x+  1 

4.5.10  - - h 


1—  x  (1— x)‘ 


4.5.11  - 1  /  a/25  -  x~  -  \J 25  -  x2/x2 


4.5.12  1  1 -  1 


169 


—  x 


4.5.13 


3x2  —  2x+  1/x2 
2a/x3  —x2  —  (1/x) 


4.5.14 

4.5.15 


300x 

(100  — x2)5/2 

1  +  3x2 
3(x+x3)2/3 
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4.5.16  4x(  x2  +  l)  + 


4x3  +  4x 


2y/l  +  (x2  +  l)2 


x2  +  l)2  +  a/1  +  (x2  +  1)2 


4.5.17  5(x  +  8)4 

4.5.18  -3(4 -x)2 

4.5.19  6x(x2  +  5)2 

4.5.20  — 12x(6  —  2x2)2 

4.5.21  24x2(l  —  4x3)-3 

4.5.22  5  +  5/x2 

4.5.23  — 8(4x  —  l)(2x2  —  x  +  3)~3 

4.5.24  l/(x+l)2 

4.5.25  3(8x  — 2)/(4x2  — 2x  +  l)2 

4.5.26  — 3x2  +  5x  —  1 


4.5.27  6x(2x - 4)3  +  6(3x2  +  1) (2x -  4)2 

4.5.28  —2/ (x—  l)2 

4.5.29  4x/(x2  +  1)2 

4.5.30  (x2  —  6x  +  7) / (x  —  3)2 

4.5.31  — 5/(3x  —  4)2 

4.5.32  60x4  +  72x3  +  1 8x2  +  1 8x  -  6 


4.5.33  (5  — 4x)/((2x+  l)2(x  — 3)2) 

4.5.34  l/(2(2  +  3x)2) 

4.5.35  56x6  +  72x5  +  1 10x4  +  100x3  +  60x2  +  28x  +  6 


4.5.36  y  =  23x/96- 29/96 

4.5.37  y  =  3  —  2x/3 
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4.5.38  y=  l3x/2-23/2 


4.5.39  y  —  2x  —  11 


4.5.40  y 


20  +  2^5  3\/5 

— /  x-\ - , 

5y4  +  y/5  5  v  4  + 


4.5.41  (f(g(l))Y  =  20 


4.5.42  g\x)  =  2 x(f(x2)  +  x2f'(x2)) 
4.6.1  21n(3)x3x2 


4.6.2 


cos  x  —  sin. x 
ex 


4.6.3  2e2x 


4.6.4  excos(ex) 

4.6.5  cos(x)esmx 


4.6.6  xsinx  cosxlnx  + 


si  n  w 


4.6.7  3xV+xV 


4.6.8  1  +2xln(2) 

4.6.9  -2.rln(3)(l/3)x2 

4.6.10  e4x(4x—  l)/x2 

4.6.11  (3x2  +  3)/(x3  +  3x) 

4.6.12  —  tan(jc) 

4.6.13  (1  —  ln(x2))/(x2  \J\n(x2)) 

4.6.14  sec(x) 

4.6.15  xC0SW  (cos(x) /x  —  cos(x)  ln(x)) 


4.6.19  e 
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4.7.1  (a)  x/y 

(b)  ~{2x  +  y)/(x  +  2y) 

(c)  (2xy-3x2-y2)/(2xy~3y2-x2) 

(d)  sin(x)  sin(y) / (cos(x)  cos(y)) 

(e)  -a fy/y/x 

(f)  (ysec2(x/y) -y2) / (xsec2(x/y)  +y2) 

(g)  (;y-cos(x  +  ;y))/(cos(x  +  .y) -x) 

(h)  -f/x2 

4.7.2  1 

4.7.3  y  =  2x  ±  6 

4.7.4  ;y  =  x/2±3 

4.7.6  (v^,2V3),  (-V3,-2V3),  (2v/3,v/3),  (-2^3, -V3) 

4.7.7  y  =  lx/V3~8/V3 

4.7.8  y=  (-y)/3x  +  yj/3xi +xJ/3>'i)/x:J/3 

4.7.9  (y  -y\)/(x-  x\ )  =  (2x3  +  2x\ yj  -  x, )  / (2y3  +  2>’|X2  +  y{ ) 

4.8.1  1 

4.9.1  (a)  4(2x  +  3) 

(b)  fx1/2 

4.9.2  3 

4.9.3  M  28*3- -A 

<b)  ~  sTA  i  +  v5)2 

r  -2  if  x  <  —2 

(c)  /'(x)  =  <  0  if  —  2  <  x  <  1 

2  ifx>l 


(d)  2x  sinxcosx  +  x2  cos2  x  —  x2  sin2  x 
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(e) 


(sinx  +  xcosx)(l  +  sinx)  —  xsinxcosx 
(1  +  sinx)2 


(f) 


3  /  3  \~1/2 

4 


(g)  W+*+l)-W(4f  +  2x)-**x  +  y)t 

(h)  3  sin2  x  cos  x  —  3x2cos(x3) 

(i)  4sec4xtanx  +  4tan3xsec2x 

(k)  (2x  +  2sec2xtanx)  sec2(sin(x2  +  sec2x))cos(x2  +  sec2 x) 

n  cos  - 

(l)  - 2 - 

x2(2  +  suiy)2 


4.9.4  (a)  3e3x-e-x 

(b)  2e2x  cos  3x  —  3e2x  sin  3x 


(c)  (1  +  ex)  sec2(x  +  ex) 

(d)  2ex/(ex  +  2)2 

cosx  cos(2  +  lnx) 

2  +  sinx  x 

(f)  exK  ■  nxn~ 1  +  nex%e~x  +  k e'  in n  ■  ex 

(g)  log((/?  +  (log^)x0og«^-' 

(h)  (x2+l )-5+l  (/3x2ln(x2  +  l)  +  2t(f+|1> 

V  X-  +  1 


(i) 


(x2  +  eY)1/lnx  (2x  +  ex  x2  +  ex\ 
(lnx)2  \x2  +  ex  x  / 


x\/x2  +x+ 1  / 1  2x+l 

(2  +  sinx)4(3x  +  5)7  \x^2(x2+x+l) 


4cosx  21  \ 
2  + sinx  3x  +  5y 


4.9.5  (a)  -(2x  +  y)/(x  +  2_y) 

x—  (2x2  +  2y2  —  x)(4x—  1) 
4y(2x2  +  2y2  —x)—y 
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(c) 

sinx  +  2xsin;y 

x2  cosy  +  3y2 

(d) 

2x  +  ey  —  ex 

2  —  xey 

4.9.6 

(a)  sin-1x  +  x/\/T 

(b) 

cos-1x  +  sin-1x 

(cos_1Jc)2Vl  —x2 

(c)  a/(x2  +  a 2) 

(d)  tan-1.* 

5.1.2  1/(16^)  cm/s 

5.1.3  3/(1000;r)  meters/second 

5.1.4  1/4  m/s 

5.1.5  -6/25  m/s 

5.1.6  80tt  mi/min 

5.1.7  3^  ft/s 

5.1.8  20/ (3k)  cm/s 

5.1.9  13/20  ft/s 

5.1.10  5\/T0/2  m/s 

5.1.11  75/64  m/min 

5.1.12  tip:  6  ft/s,  length:  5/2  ft/s 

5.1.13  tip:  20/11  m/s,  length:  9/11  m/s 

5.1.14  380/ 150^  69.4  mph 

5.1.15  500/ a/3  —  200  «  88.7  km/hr 

5.1.16  4000/49  m/s 


5.2.1  min  atx—  1/2 
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5.2.2  min  at  x  —  —  1,  max  at  x  =  1 

5.2.3  max  atx—2,  min  at x  =  4 

5.2.4  min  atx  —  ±1,  max  at  x  =  0. 

5.2.5  min  at  x  —  1 

5.2.6  none 

5.2.7  none 

5.2.8  min  atx  =  7;r/124-  kn,  max  aix  =  —n/\2  +  kn,  for  integer  k. 

5.2.9  local  min  at  x  =  49 

5.2.12  one 

5.2.16  Absolute  maximum  (3,7);  Absolute  minimum  (0, 1). 

5.2.17  Absolute  maximum  (3,7);  Absolute  minimum  (0, 1). 

5.2.18  Absolute  minimum  (n/2, 1);  No  absolute  maximum. 

5.2.19  Absolute  minimum  (1,0);  Absolute  maximum  (e1/2,  ^). 

5.2.20  Absolute  minimum  (1,0);  Absolute  maximum  (e1/2,  ^). 

5.2.21  Absolute  minimum  (0,0);  Absolute  maximum  (2,2e1/8). 

5.2.22  Absolute  minimum  (1/2,  =^~ ) ;  Absolute  maximum  (2,2  —  tan 1  (4) ) . 

5.2.23  Absolute  maximum  (1, 1/2);  Absolute  minimum  (— 1,  — 1/2). 

5.3.1  c  =  1/2 

5.3.2  c=\T\ 8-2 

5.3.6  x3/3  +  47x2/2  —  5x4- A' 

5.3.7  arctanx  +  A: 

5.3.8  x4/4  — lnx  +  A' 


5.3.9  —  cos(2x)/2  +  & 
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5.4.1  L(x)  =  x,  /(O.l)  ~  L(O.l)  =0.1 

5.4.2  Choose  f(x)  —  x3  and  a  —  2,  the  closest  integer  to  1.9.  The  linearization  of  /  at  a  is  L(x)  —  12(x  — 
2) +  8,  and  (1.9)3  = /(1.9)  «L(1.9)  =  12(1.9  -  2)  +  8  =  6.8. 

5.4.4  Choose  a  =  7  since  /(7)  =  ^7  +  1  =  \/8  =  2  is  an  integer  close  to  \/9.  The  linearization  of  /  at 
a  =  7  is  L(x)  -  l/n{x  -  7)  +  2.  Then  /(8)  =  J/8+l  =  y/9  «  L(8)  =  1/12(8  -  7)  +  2  =  2.083.  We  are 
over-estimating  \/9  since  L(x)  >  fix)  for  all  x  around  a  =  7. 


5.4.5  Ay  =  65/16,  r/y  =  2 

5.4.6  Ay  =  y/n/10-  1,  cly  =  0.05 

5.4.7  Ay  =  sin(7r/50),  dy  =  tt/50 

5.4.8  </V  -  8k/25 

5.4.9  r5(*)  =*-£  +  £ 

(a)  sin(0.1)«r5(0.1)«  0.10016675 

(b)  sin(O.l)  =0.0998334...  using  a  calculator.  Our  approximation  is  accurate  to  0.10016675 —  0.0998334. . 
0.0003. 

5.4.11  T$(x)  =  x  +  x2  +x3.  The  point  x  =  5  is  not  close  to  x  —  0,  and  /  is  not  continuous  at  x  =  1. 

5.4.12  (a)  /W (x)  =  (~1)("~j,)(B~1)! 

n  /  (~ 1)('~1)(t— !)!  X  n  / 

(b)  Tn{x)  —  ln(l)  +  ^ ^ - (x  —  1)'  =  ( - 7: - -  ]  (x—  1)'  since  ln(l)  =  0  and 

i=  1  l-  i=  1  V  z-  J 

ln=  1. 

5.4.13  Notice  that  /(— 2)  =  19,  /(0)  =  —11,  and  /(5)  =  19  and  /  is  a  continuous  function.  By  the 
Intermediate  Value  Theorem  there  exists  a  root  in  [—2,0]  and  [0,5].  Choose  xo  =  0,  then  X4  ~  —0.93242. 
Choose  xo  =  5,  then  X4  ~  3.93242. 

5.4.14  (a)  X4«  1.00022... 

(b)  x  =  1  is  the  root  of  /.  Our  approximation  in  part  (a)  was  correct  to  3  decimal  places. 

(c)  x\  =  1.  The  root  is  found  in  one  iteration  of  Newton's  Method. 

5.4.15  cos(7t/2)  =  0,  so  x\  is  undefined. 


5.5.1  0 


Selected  Exercise  Answers  595 


5.5.2  °o 

5.5.3  0 

5.5.4  0 

5.5.5  1/6 

5.5.6  1/16 

5.5.7  3/2 

5.5.8  -1/4 

5.5.9  -3 

5.5.10  1/2 

5.5.11  0 

5.5.12  -1 

5.5.13  -1/2 

5.5.14  5 

5.5.15  1 

5.5.16  1 

5.5.17  2 

5.5.18  1 

5.5.19  0 

5.5.20  1/2 

5.5.21  2 

5.5.22  0 

5.5.23  1/2 

5.5.24  -1/2 
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5.5.25  2 

5.5.26  0 

5.5.27  °o 

5.5.28  0 

5.5.29  5 

5.5.30  -1/2 

5.6.1  min  atx  —  1/2 

5.6.2  min  at  x  =  —  1,  max  at  x  =  1 

5.6.3  max  at  x  =  2,  min  at  x  =  4 

5.6.4  min  atx  —  ±1,  max  at  x  =  0. 

5.6.5  min  at  x  =  1 

5.6.6  none 

5.6.7  none 

5.6.8  min  atx  =  77r/12  +  kn,  max  atx=-7r/12  +  kn,  for  integer  k. 

5.6.9  none 

5.6.10  max  at  x  =  0,  min  at  x  =  ±  1 1 

5.6.11  min  atx  —  —3/2,  neither  at  x  =  0 

5.6.12  min  at  nn,  max  at  n/2  +  nn 

5.6.13  min  at  2nn,  max  at  (2n  +  1)k 

5.6.14  min  at  k/ 2  +  2 nn,  max  at  3tt/2  +  2nn 

5.6.17  min  atx  =  1/2 

5.6.18  min  at  x  =  —  1,  max  at  x  =  1 


5.6.19  max  at  x  =  2,  min  at  x  =  4 
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5.6.20  min  at  x  —  ±1,  max  at  x  —  0. 

5.6.21  min  at  x  =  1 

5.6.22  none 

5.6.23  none 

5.6.24  min  at  x  —  1k/\2  +  nn ,  max  at  x  =  —n/ 12  +  nn,  for  integer  n. 

5.6.25  max  at  x  =  63/64 

5.6.26  max  atx  —  1 

5.6.27  max  at  —  5~1//4,  min  at  5~1//4 

5.6.28  none 

5.6.29  max  at  —1,  min  at  1 

5.6.30  min  at  2-1/3 

5.6.31  none 

5.6.32  min  at  nn 

5.6.33  max  at  nn,  min  at  n/2  +  nn 

5.6.34  max  at  n/2  +  Inn,  min  at  3tt/2  +  2nn 

5.6.35  concave  up  everywhere 

5.6.36  concave  up  when  x  <  0,  concave  down  when  x  >  0 

5.6.37  concave  down  when  x  <  3,  concave  up  when  x  >  3 

5.6.38  concave  up  when  x  <  —  1  /  V3  or  x  >  1  /  \/3,  concave  down  when  —  1  /  V3  <  x  <  1  /  V3 

5.6.39  concave  up  when  x<0orx>2/3,  concave  down  when  0  <  x  <  2/3 

5.6.40  concave  up  when  x  <  0,  concave  down  when  x  >  0 

5.6.41  concave  up  when  x  <  —  1  or  x  >  1,  concave  down  when  —1  <x<0or0<x<  1 

5.6.42  concave  down  on  ((8 n  —  1)tt/4,  (8n  +  3)tt/4),  concave  up  on  ((8n  +  3)n/4,  (Sn  +  7)tt/4),  for  in¬ 
teger  n 


598  Selected  Exercise  Answers 

5.6.43  concave  down  everywhere 

5.6.44  concave  up  on  (—00,(21  —  V497)/4)  and  (21  +  V497)/4,oo) 

5.6.45  concave  up  on  (0,°°) 

5.6.46  concave  down  on  (2rm/3,  (2 n  +  1)tt/3) 

5.6.47  concave  up  on  (0,°°) 

5.6.48  concave  up  on  (—  °°,  —  1)  and  (0,°°) 

5.6.49  concave  down  everywhere 

5.6.50  concave  up  everywhere 

5.6.51  concave  up  on  (n/4  +  nn,3n/4  +  nn) 

5.6.52  inflection  points  at  nn,  ±arcsin(A/2/3)  +nn 

5.6.53  up/incr:  (3,°o),  up/decr:  (— °o,0),  (2,3),  down/decr:  (0,2) 

5.7.1  25  x  25 

5.7.2  P/4  x  P/4 

5.7.3  w  =  l  =  2  •  52/3,  h  =  52/3,  h/w  =  1/2 

5.7.4  //TOO  x  //TOO  x  2V/I00,  h/s  =  2 

5.7.5  w  =  l  =  21/3V1/3?  h  =  y1/3/22/3,  h/w  =  1/2 

5.7.6  1250  square  feet 

5.7.7  I2/ 8  square  feet 

5.7.8  $5000 

5.7.9  100 

5.7.10  r 2 

5.7.11  h/r=  2 


5.7.12  h/r  -  2 
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5.7.13  r  —  5,  h  =  40/tt,  h/r  =  %/iz 

5.7.14  8/k 


5.7.15  4/27 

5.7.16  (a)  2,  (b)  7/2 

=  n,n  73  73  1  1 

5-7J7  ^X^  +  2X4 


73 

12 


5.7.18  (a)  a/6,  (b)  (a  +  b  —  7^2  —  ob  +  b2) / 6 

5.7.19  1.5  meters  wide  by  1.25  meters  tall 

5.7.20  If  k  <  2/tt  the  ratio  is  (2  —  kn)/4;  ifk>2/n,  the  ratio  is  zero:  the  window  should  be  semicircular 
with  no  rectangular  part. 


5.7.21  a/b 

5.7.22  1/73  «  58% 

5.7.23  18  x  18x36 


5.7.24  r  —  5 / (2tt)  1//3  ~  2.7  cm, 
h  =  5  •  25/3/7T1/3  =  4r  «  10.8  cm 


5.7.25  h 


750  /  27T2  \  1/3  /  7502\  1/6 

~jr\75&)  ’  r  —  \  2k^  J 


5.7.26  h/r  =  72 

5.7.27  1/2 

5.7.28  $7000 
6.1.1  10 
6.1.2  35/3 


6.1.3  x2 


6.1.4  2x2 

6.1.5  2x2  —  8 
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6.1.6  lb2 -la2 

6.1.7  4  rectangles:  41/4=  10.25,  8  rectangles:  183/16=  11.4375 

6.1.8  23/4 

6.2.1  87/2 

6.2.2  2 

6.2.3  ln(10) 

6.2.4  e5  - 1 

6.2.5  34/4 

6.2.6  26/ 6  —  1/6 

6.2.7  x2  —  3x 

6.2.8  2x(x4  —  3x2) 

6.2.9  ev2 

6.2.10  2x/ 

6.2.11  tan(x2) 

6.2.12  2xtan(x4)  —  10tan(100.r2) 

6.2.13  31,  14 

6.2.14  5 

6.2.15  (a)  2/3 

(b)  24/5 

6.3.1  (16/3  )x3/2  +  C 

6.3.2  t2  4”  1 4”  C 

6.3.3  S\/x-\ -C 


6.3.4  -2/z  +  C 
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6.3.5  llns  +  C 

6.3.6  (5.r+l)3/15  +  C 

6.3.7  (;c-6)3/3  +  C 

6.3.8  2x5/2/5  +  C 

6.3.9  -4/^  +  C 

6.3.10  4t  —  t2  +  C,t<2-,t2  —  4t  +  S  +  C,t>2 

7.1.1  -(l-t)10/10  +  C 

7.1.2  x5/5 +  2.r3/3 +.r  +  C 

7.1.3  (x2  +  l)101/202  +  C 

7.1.4  —3(1  —  5t)2/3/10  +  C 

7.1.5  (sin4.r)/4  +  C 

7.1.6  -(100-x2)3/2/3  +  C 

7.1.7  —2\J  1  — x3/3  +  C 

7.1.8  sin(sin7rf)/;r  +  C 

7.1.9  l/(2cos2x)  =  (1/2)  sec2.r  +  C 

7.1.10  —  In  |  cos.r|  +C 

7.1.11  0 

7.1.12  tan2(x)/2  +  C 

7.1.13  1/4 

7.1.14  —  cos(tanx)+C 

7.1.15  1/10 

7.1.16  V3/4 

7.1.17  (27/8) (x2-7)8/9 
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7.1.18  — (37  +  l)/14 

7.1.19  0 

7.1.20  /(x)2/2 


7.2.1  x/2  —  sin(2x)/4  +  C 

7.2.2  —  cosx+ (cos3x)/3  +  C 

7.2.3  3x/8  —  (sin2x)/4  +  (sin4x)/32  +  C 

7.2.4  (cos5x)/5  —  (cos3x)/3 +C 

7.2.5  sinx— (sin3x)/3 +C 

7.2.6  (sin3x)/3  —  (sin5x)/5  +  C 

7.2.7  — 2(cosx)5/2/5  +  C 


7.2.8  tanx  —  cotx  +  C 

7.2.9  (sec3x)/3  — secx  +  C 

7.2.10  —  cosx+sinx  +  C 


3  1 

7.2.11  -  In  I  secx  +  tanxl  +  tanx  +  -  secxtanx  +  C 

2  2 

„  „  tan5(x2)  ^ 

7.2.12  — j^  +  C 


7.3.1  xV x2  —  1/2  —  In  |x+  V x2  —  l|/2  +  C 

7.3.2  x\/9  +  4x2/2  +  (9/4)  In  1 2x  +  \/9  +  4x2 1  +  C 

7.3.3  —  (1  —  x2)3/2/3  +  C 

7.3.4  arcsin(x)/8  —  sin(4arcsinx)/32  +  C 


7.3.5  ln|x+  \J  1  +x2|  +  C 

7.3.6  (x+  1)  V*2  +  2x/2  —  In  |x+  1  +  \/ x2  +  2x\/2  +  C 


7.3.7  —  arctanx  —  l/x  +  C 
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7.3.8  2  arcsin(x/ 2)  —  xs/ 4  —  x2/2  +  C 

7.3.9  arcsin( y/x)  —  \ fx\J l—x  +  C 

7.3.10  (2x2  +  1) v7 4x2  —  1/24  +  C 

7.4.1  cosx  +  xsinx  +  C 

7.4.2  x2sinx  —  2sinx  +  2xcosx  +  C 

7.4.3  (x-1  )e*  +  C 

7.4.4  (1/2 )ex2+C 

7.4.5  (x/2)-sin(2x)/4  +  C  = 

(x/2)  —  (sinxcosx)/2  +  C 

7.4.6  xlnx  —  x  +  C 

7.4.7  (x2  arctanx  +  arctanx  —  x)/2  +  C 

7.4.8  —x3  cosx  +  3x2  sinx  +  6xcosx  —  6  sinx  +  C 

7.4.9  x3  sinx  +  3x2 cosx  —  6xsinx  —  6 cosx  +  C 

7.4.10  x2/4—  (cos2x)/4  —  (xsinxcosx)/2  +  C 

7.4.11  x/4  —  (xcos2x)/2  +  (cosx  sinx) /4  +  C 

7.4.12  xarctanf  x/r)  +  arctan  ( y/x)  —  y/x  +  C 

7.4.13  2  sin(y(r)  —  2^/xcos(y/x)  +C 

7.4.14  secxcscx  —  2  cotx  +  C 

7.5.1  -ln|x-2|/4  +  ln|x  +  2|/4  +  C 

7.5.2  —  x3/3  —  4x  —  41n  |x  —  2|  + 

4 In  |x  +  2|  -f  C 

7.5.3  -l/(x  +  5)+C 

7.5.4  — x  — lnlx  — 21 +ln|x  +  2|  +  C 


7.5.5  — 4x  +  x3/3  +  8arctan(x/2) +C 
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7.5.6  (l/2)arctan(x/2±5/2)  ±C 

7.5.7  x2/2  — 21n(4±v2)  ±C 

7.5.8  (l/4)ln|x±3|-(l/4)ln|x±7|±C 

7.5.9  (l/5)ln|2x  — 3|  —  (l/5)ln|l±v|±C 

7.5.10  (l/3)ln|jc|  -  (l/3)ln|*±3|  ±C 

7.6.1  T,S:  4±0 

7.6.2  T:  9.28125  ±0.281125;  S:  9±0 

7.6.3  T:  60.75  ±1;  S:  60  ±0 

7.6.4  T:  1. 1 167  ±  0.0833;  S:  1.1000  ±0.0167 

7.6.5  T:  0.3235  ±0.0026;  S:  0.3217  ±0.000065 

7.6.6  T:  0.6478  ±0.0052;  S:  0.6438  ±0.000033 

7.6.7  T:  2.8833  ±0.0834;  S:  2.9000  ±0.0167 

7.6.8  T:  1.1170±0.0077;  S:  1.1114±0.0002 

7.6.9  T:  1.097 ±0.0147;  S:  1.089 ±0.0003 

7.6.10  T:  3.63  ±0.087;  S:  3.62  ±0.032 

7.7.1  Converges  to  1 . 

7.7.2  Diverges. 

7.7.3  1/3 

7.7.4  Divergent. 

7.7.7  (a)  n/2 

(b)  divergent  (to  °°) 

(c)  1 

(d)  divergent  (to  °°) 

(e)  |(43/5) 
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7.7.9  0  <  p  <  1 

(t  +  4)4 


7.8.1 


+  C 


(f2  —  9)5/2 

7.8.2  - -2 — +C 


7.8.3  (er  +  16>2+C 


2  , 

7.8.4  cos  t—  -  cos  t  +  C 


7.8.5 


tan2/ 


■fC 


7.8.6  ln|r  +  /  +  3|+C 

7.8.7  ^ln|l-4/r|+C 
8 


7.8.8  —  tan(arcsin(/75))  +C  — - . ? 

25  1  "  25V25^? 


+  C 


7.8.9  ^\/sin  3/  +  C 

7.8.10  /tan/  +  ln|cos/|  +C 

7.8.11  2Vef  +  l+C 

7.8.12  4  + ^2  + ^+C 
7.8.!3  !^M-4EA+c 


7.8.14 


7.8.15 


7.8.16 


3  3 

-1 


sin  arc  tan/ 
-1 


■fC  —  —  \J  1  t2/ 1  -\-  C 


+c 


2(1  +  tan/)2 
(z2  + 1)5/2  (r2  + 1)3/2 


+c 


5 


3 
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7.8.17 


er  sin  /  —  e1  cost 


7.8.18  Ai±iZ£+c 
6 

7'8'19  3(2-<2)3/2_(2-<2)l/2+C 

7.8.20  Hsin(arc,an(2,/3))|  +  c  =  (|n(4(2)  _  |n(9  +  4,2))/ 18  +  C 


_  _  (arctan(2t))2  ^ 

7.8.21  - - /  ”  +C 

4 


7,8,22  31n'f  +  3l+il^4+C 
4  4 


7.8.23 


cos7 1  cos5 1 


7.8.24  —  +  C 


7.8.25  — +C 

lnt 


r(ln/)2  t2lnt  t2 
7.8.26^-  —  +  -+C 


7.8.27  (t3-3t2  +  6t-6)er+C 


7.8.28  ln(2t+l-v/5)  +  5  ^ ln(2f  +  1  +  v7^)  +C 

8.1.1  It  rises  until  t  —  100/49,  then  falls.  The  position  of  the  object  at  timet  is  s(t)  =  —4.9t2  +  20t  +  k.  The 
net  distance  traveled  is  —45/2,  that  is,  it  ends  up  45/2  meters  below  where  it  started.  The  total  distance 
traveled  is  6205/98  meters. 


8.1.2  /  sintdt  =  0 
Jo 


8.1.3  net:  2k,  total:  27r/3  +  4\/3 


8.1.4  8 


8.1.5  17/3 
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8.1.6  A  =  18,  B  =  44/3,  C  =  10/3 

8.2.1  8V2/15 

8.2.2  1/12 

8.2.3  9/2 

8.2.4  4/3 

8.2.5  2/3-2/tt 

8.2.6  3/tt  —  3 v/3/(2tt)  —  1/8 

8.2.7  1/3 

8.2.8  lOx/5/3  —  6 

8.2.9  500/3 

8.2.10  2 

8.2.11  1/5 

8.2.12  1/6 

8.3.5  8n/3 

8.3.6  tt/30 

8.3.7  n{n/2  —  1) 

8.3.8  (a)  114tt/5  (c)  20tt 

(b)  lAn/5  (d)  An 

8.3.9  \6n,  24^: 

8.3.11  nh2(3r  —  h)/3 

8.3.13  In 

8.4.1  2 /n;  2 /n;  0 


8.4.2  4/3 
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8.4.3  1/A 

8.4.4  tt/4 

8.4.5  -1/3,  1 

8.4.6  -4V1224  ft/s;  -8V1224  ft/s 

8.5.1  «  5,305,028,516  N-m 

8.5.2  «  4,457,854,041  N-m 

8.5.3  367,500tt  N-m 

8.5.4  490007T+ 196000/3  N-m 

8.5.5  24507T  N-m 

8.5.6  0.05  N-m 

8.5.7  6/5  N-m 

8.5.8  3920  N-m 

8.5.9  23520  N-m 

8.5.10  12740  N-m 

8.6.1  15/2 

8.6.2  5 

8.6.3  16/5 

8.6.5  x  =  45/28,  y  =  93/70 

8.6.6  x  =  0,  y  =  4/ (3  k) 

8.6.7  x  =  1/2,  y  =  2/5 

8.6.8  Jc  =  0,  y  =  8/5 

8.6.9  x  =  4/7,  j  =  2/5 

8.6.10  x  =  y  =  1/5 
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8.6.11  x  =  0,  y  =  28/ (9tt) 

8.6.12  x  =  y  =  28 /(9k) 


8.6.13  x  =  0,  y  =  244/ (21k)  «  2.88 

8.7.1  (22a/22  —  8)  /  27 

8.7.2  ln(2)  4-3/8 

8.7.3  a  +  a3/3 

8.7.4  ln((v^+l)/V^) 

8.7.6  3/4 

8.7.7  rj  3.82 

8.7.8  «  1.01 


8.7.9  \/l+e2-V2+^ln  ^ 


V 1  +  e1  —  1 
x/ 1  +  e2  +  1 


8.8.1  SkV3- 


16^\/2 


8.8.3 


730^V/730  10^\/l0 


27 


27 


1  o  7T  2  7F 

8.8.4  7r  +  27T£?  +  -7T7?2 - T - 

4  4e-  e 


8.8.6  8k- 


8.8.7  2k  + 


8  K~ 

3x/3 


8.8.8  £7  >  b:  2Kb~+ 


Incrb 


:  arcsin 


(\/  a2  —  b2/a). 


V  a2  —  b2 
a  <b:  2nb2-\- 

Incrb  .  /  6  \/b2  —  a2 


Vb2  —  cv 


In  -  + 


£7  £7 


^ln(3  +  2v/2) 


9.1.1  1 
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9.1.3  0 

9.1.4  1 

9.1.5  1 

9.1.6  0 

9.2.1  lim  n2/ (2n2  + 1) = 1/2 

n-^oo 

9.2.2  lim  5/(21/"  +  14)  =  1/3 

n— >oo 

OO  j 

9.2.3  £“=i  \  diverges,  so  ^  3-  diverges 

n= 1  n 

9.2.4  -3/2 

9.2.5  11 

9.2.6  20 

9.2.7  3/4 

9.2.8  3/2 

9.2.9  3/10 

9.3.1  diverges 

9.3.2  diverges 

9.3.3  converges 

9.3.4  converges 

9.3.5  converges 

9.3.6  converges 

9.3.7  diverges 

9.3.8  converges 


9.3.9  N  =  5 


9.3.10  N  =  10 

9.3.11  N  —  1687 

9.3.12  any  integer  greater  than  e200 

9.4.1  converges 

9.4.2  converges 

9.4.3  diverges 

9.4.4  converges 

9.4.5  0.90 

9.4.6  0.95 

9.5.1  converges 

9.5.2  converges 

9.5.3  converges 

9.5.4  diverges 

9.5.5  diverges 

9.5.6  diverges 

9.5.7  converges 

9.5.8  diverges 

9.5.9  converges 

9.5.10  diverges 

9.6.1  converges  absolutely 

9.6.2  diverges 

9.6.3  converges  conditionally 
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9.6.4  converges  absolutely 
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9.6.5  converges  conditionally 

9.6.6  converges  absolutely 

9.6.7  diverges 

9.6.8  converges  conditionally 

9.7.5  (a)  converges 

(b)  converges 

(c)  converges 

(d)  diverges 

9.8.1  (a)  R=  1,1=  (-1,1) 

(b)  R  =  I  —  (—00,00) 

(c)  R  =  e,  I  =  (2  —  e,  2  +  e) 

(d)  R  =  0,  converges  only  when  x  —  2 

(e)  R=  1,7=  [-6,-4] 

9.8.2  R  =  e 


9.9.1  the  alternating  harmonic  series 


9.9.2  £(n+  l)x” 

n= 0 


9.9.3  £(n+l)(n  +  2)jc" 

n= 0 

9.9.4  £(”  +  1)('1  +  2)y  j;=1 

n  2 

12=0 


00  1 

9.9.5  C+V  - - ^ - - 

„=0  («  +  !)(» +  2) 


*"+2 


9.10.1  (a)  £(-l)V7(2n)!,l?  =  ' 


n=0 


(b)  Y,x'l/nl,R  = 

n= 0 
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<0 

n= 0  J 

<d) 

n—  1 


00  /  ^  \  72 

(e)  ln(2)+^(-l)”-iy^J-,R  =  2 
1 


ill” 


(f)  £(-!)>  +  l)(x-l)”,R=l 


n=0 


(g>1+E  1'3'5„;12"-1)^1+e^^^.^1 


/2=  1 
A 


n\2n 

(h)  x  ±  xJ  /  3 

CXD 

(i)  £(-l)"x4"+1/(2,.)! 


22”  !(n— l)!n! 


n=0 


0)  £(-i)v+7«! 

n=0 


x2  x4  X6 


9.11.1  1-  —  +  — 


42 


2  '  24  720 H  h  12! 


9.11.2  1000;  8 


x ^  2x5 

9.11.3  x±  —  ±  -y^-,  error  ±1.27. 


10.1.2  y  =  arctant  +  C 


fn+ 1 

10.1.3  y  = - ±  1 

n  +  1 

10.1.4  y  =  tint  —  t  +  C 

10.1.5  y  =  nK,  for  any  integer  n. 

10.1.6  none 

10.1.7  y=±Vt2  +  C 

10.1.8  y  =  ± l,y  =  (1  ±Ae2f)/(l  - Ae2t ) 

10.1.9  y4/4-5y  =  r/2±C 
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10.1.10  y  =  (2t/3)3/2 

10.1.11  y  =  M  +  Ae~kt 


10.1.12  1Qln(15/2)  -2.52  minutes 
In  5 


10.1.13  y 


M 


1  +Ae 


-Mkt 


10.1.14  y  =  2c3f/2 


10.1.15  t  = 


In  2 
k 


10.1.16  600c  61n2/5  261  mg;  ~  4 1  days 

&  In  2 


10.1.17  ioOe~2001n2/191  «  48  mg;  573^1>n5°  «  32339  years 


10.1.18  y  —  y0crln2 

10.1.19  500c_51n2/4  ~  210  g 

10.2.1  y  =  Ae~5t 

10.2.2  y  =  Ae2t 

10.2.3  y  =  Ac-"f 

10.2.4  y  =  Ae~f3/ 3 

10.2.5  y  =  Ae~f 


10.2.6  y  =  -2e3t~3 


10.2.7  y  =  el+cost 


10.2.8  y  =  e2e~e' 


10.2.9  y  =  0 

10.2.10  y  =  0 

10.2.11  y  -  4t2 
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10.2.12  y  =  - 2e(-l/t)~l 

10.2.13  y  =  el~r2 


10.2.14  y  =  0 

10.2.15  k  =  ln5,  y  —  100e_/ln5 

10.2.16  k  =  —12/13,  y  =  exp(  — 13/1/13) 

10.2.17  j  =  10Vln(3/2) 

10.2.18  >’  =  10e"/ln(2)/6 

10.3.1  y  =  Ae~4t  +  2 

10.3.2  y  =  Ae2t  —  3 

10.3.3  y  =  Ae-(1/2)f2+5 

10.3.4  y  =  Ae~e'  -  2 

10.3.5  _y  =  Aef  —  t2  —  It  —  2 

10.3.6  y  =  A£>-f/2+t-2 

10.3.7  y  =  At2  —  — 

7  3r 

10.3.8  y  —  -  +  -\/t 

10.3.9  y  —  A  cost  +  sint 

10.3.10  y  = - - - +  1 - - - 

sect  +  tant  sect  +  tant 

10.4.1  y(l)  «  1.355 

10.4.2  y(l)  «  40.31 

10.4.3  y(l)  «  1.05 


10.4.4  y(l)  «2.30 


10.5.1 


<°+1cw  , 

2(0  2co 


e 


at 
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10.5.2  2  cos(3/)  +  5sin(3t) 

10.5.3  -(l/4)e~5f  +  (5/4)g-f 

10.5.4  —2e~3t  +  2eAt 

10.5.5  5e’6r  +  20 te~6t 

10.5.6  {I6t-3)e4t 

10.5.7  —  2cos(v/5t)  +  v/5sin(v/5t) 

10.5.8  —  V2.  cost +  V2.  sin  t 

10.5.9  e~6r  (4cost  +  24sint) 

10.5.10  2e~3t  sin(3t) 

10.5.11  2cos(2t  —  7t/6) 

10.5.12  5\/2cos(10r  —  ft/4) 

10.5.13  V2e~2t  cos(3t  —  K  /  A) 

10.5.14  5e4?cos(3t +  arcsin(4/5)) 

10.5.15  (2cos(5t)  +  sin(5t))e~2f 

10.5.16  — (l/2)e_2/ sin(2t) 

10.6.1  Ae5t  +  BteSt  +  (6/169)  cos t  -  (5/338)  sinf 

10.6.2  Ae~^  +  Bte~^  +  5 

10.6.3  Acos(4t)  +5sin(4t)  +  (l/2)t2  +  (3/16)f - 5/16 

10.6.4  Acos(v/2t)  +  Bsin(V2t)  —  (cos(5f)  +sin(5f))/23 

10.6.5  /(A  cost +  .6  sin?)  Jre2t / 2 

10.6.6  Ae^  +  Be~^  +  2  - 1/3  -  e~* / 5 

10.6.7  Ae~3t  +  Be2t  -{\/5)te~3t 

10.6.8  Ae1  +Be3t  +  (\/2)te31 
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10.6.9  Acos(4t)  +  Z?sin(4t)  +  (l/8)tsin(4t) 

10.6.10  Acos(3t)  +  Z?sin(3t)  —  (l/2)tcos(3t) 

10.6.11  Ae~6t  +  Bte~ 6t  +  3  t2e-6t 

10.6.12  Ae4t  +Bte4t  -t2e4t 

10.6.13  Ae~t+Be~5t  +  (4/5 ) 

10.6.14  Ae4t  +  Be~3t  +  (1/144)  -  (t/12) 

10.6.15  Acos(v/5r)  +  B sin( V5t)  +  8sin(2t) 

10.6.16  Ae2t  +  Be~2t  +  te2t 

10.6.17  4et  +  e~t-3t-5 

10.6.18  —(4/27)  sin(3t)  +  (4/9)t 

10.6.19  e-6t(2cost  +  20smt)  +  2e~4t 

(  23  59^  \  23  11 

1#-6'2#  (“325  C°S(3,)  +  975  Sin(3,) )  +  325  C°S'  “  325  1 sinf 

10.6.21  e~2f(Asin(5t)  +  5cos(5t))  +  8sin(2t)  +  25cos(2t) 

10.6.22  e~2t (A sin(2t)  +  Z?cos(2t))  + (14/195)  sin t  —  (8/195)  cost 

10.7.1  Asin(t)+.ficos(t)  — 
costln|  sect +  tant| 

1 

10.7.2  Asin(t)  +Z?cos(t)  +  -e~l 

10.7.3  Asin(2t)  +5cos(2t)  +cost  —  sintcostln|  sect  +  tant| 


1  9  1 

10.7.4  Asin(2t)  +Z?cos(2t)  +  -  sin(2t)  sin  (t)  +  -  sin(2t)ln|  cost| 


^  cos(2t)  +  ^  sin(2t)  cos(2t) 


10.7.5  Ae1’  +  Be~lr  +  -(t-A  +  AA  ‘A 


15 


10.7.6  Ae 1  sin  t  +  Be 1  cos  t  —  e1  cos  t  In  I  sec  t  +  tan  1 1 


10.7.7  Ae1  sin t  +  Be1  cost  —  ^cost(cos3t  +  3sin3t  —  2cost  —  sint)  +  ^  sint(sin3t  —  3cos3t  —  2sint  + 
cost)  =  cos(2t)  -  —  sin(2f) 


11.1.2 
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(a)  0  =  arctan(3) 

(d)  r  —  a/5 

(g)  r 

(b)  r  —  — 4csc0 

(e)  r2  =  sin  0  sec3  0 

(h)  r 

(c)  r  —  sec  0  esc2  0 

(f)  rsin0  =  sin(rcos0) 

(i)  0 

11.1.4  (a)  (x2  +  y2)2  =  4 x2y  -  (x2  +y2)y  (c)  x2  +y2  =  x2y2 

(b)  (x2  +y2)3/2  =  y2  (d)  x4  +x2y2  =  y2 


2/(sin0  —  5cos0) 
2sec0 

r2c os2  0  —  rsin  0  +  1 


11.2.1  (a)  (0cos0 +  sin0)/(  — 0sin0+cos0),  (02  +  2)/(— 0sin0  +  cos0)3 

cos0  +  2sin0cos0  3(1  + sin  0) 

cos2  0  —  sin2  0  —  sin  0  (cos2  0  —  sin2  0  —  sin  0)3 

(c)  (sin2  0  —  cos2  0)/(2sin0cos  0),  —1/(4 sin3  0cos3  0) 

2  sin  0  cos  0  2 

cos2  0  — sin2  0  (cos2  0  —  sin2  0)3 

(e)  undefined 


2  sin  0  —  3  sin3  0  3  cos4  0  —  3  cos2  0  +  2 
^  ^  3cos30  — 2cos0’  2cos3  0(3cos2  0  —  2)3 

11.3.1  (a)  1 

(b)  9tt/2 

(c)  a/3/3 


(d)  tt/12  +  x/3/ 16 

(e)  Tier  j 4 

(f)  4 1  tt/2 


11.3.2  2  —  n/2 


11.3.3  tt/12 

11.3.4  3tt/16 

11.3.5  n/4-3V3/8 

11.3.6  tt/2  +  3x/3/8 

11.3.7  1 

11.3.8  3/2  —  tt/4 

11.3.9  K/3  +  V3/2 

11.3.10  K/3-V3/4 
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11.3.11  4tt3/3 

11.3.12  K1 


11.3.13  5tt/24  —  \/3/4 

11.3.14  77T/12 -V3 


11.3.15  4;r  —  \/l5/2  — 7arccos(l/4) 

11.3.16  3  K3 


11.4.6  ^  =  r-7"W,r=i_£2!W 
2  2 


11.5.1  There  is  a  horizontal  tangent  at  all  multiples  of  n. 


11.5.2  9^/4 

rlK  1 

11.5.3  /  -v7^  —  4cost  dt 

Jo  2 

12.1.6  3,  V26,  V29 

12.1.7  v7^,  2y/U,  3vT4. 

12.1.8  (x  —  l)2  +  (y  —  l)2  +  (z  —  l)2  =  4. 

12.1.9  (x  — 2)2  +  (;y+  l)2  +  (z  —  3)2  =  25. 

12.1.11  (x  —  2)2  +  (y  —  l)2  +  (z+  l)2  =  16,  (;y-l)2  +  (z+l)2  =  12 

12.2.6  v7!^),  (0,-2),  (2,8)  2,  2VT7,  (-2,-6) 

12.2.7  (0,4,0),  (2,0,6)  4,  2^l0,  (-2, -4, -6) 

12.2.8  V2 ,  (0,-2, 3),  (2,2, -1)  y/l3,  3,  (-2,0, -2) 

12.2.9  \/3,  (1,  — 1,4),  (1,-1, -2)  3-\/2,  V6,  (-2, 2, -2) 

12.2.10  \/l4,  (2,1,0),  (4,3,2)  v7^,  v7^,  (-6, -4, -2) 

12.2.11  (-3, -3, -11),  (— 3/\/l39,— 3/\/l39,  — ll/\/l39)  (-12/VT39, -12/VT39,  -44/\/l39) 


12.2.12  (0,0,0) 
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12.2.13  0;  (— ry/3/2,r/2);  (0,  —  12r);  where  r  is  the  radius  of  the  clock 

12.3.1  3 

12.3.2  0 

12.3.3  2 

12.3.4  -6 

12.3.5  42 

12.3.6  V6/V7,  ~  0.39 

12.3.7  - 1 1  \/l4 a/29/406,  «  2. 15 

12.3.8  0,  n/2 

12.3.9  1/2, 7T/3 

12.3.10  —I/a/3,  «  2.19 

12.3.11  arccos(  1  / a/3)  ~  0.96 

12.3.12  a/5,  (1,2,0). 

12.3.13  3a/14/7,  (9/7, 6/7, 3/7). 

12.3.14  (0,5),  (5 a/3,0) 

12.3.15  (0,  15a/2/2),(15a/2/2,0) 

12.3.16  Any  vector  of  the  form  (a,  — 7a/2,  —2a) 

12.3.17  ( 1/ a/3,  — 1/a/3,  1/ a/3) 

12.3.18  No. 

12.3.19  Yes. 

12.4.1  (1,-2, 1) 

12.4.2  (4, -6, -2) 


12.4.3  (-7, 13, -9) 
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12.4.4  (0,-l,0) 

12.4.5  3 

12.4.6  21  x/2/2 

12.4.7  1 

12.5.1  (x  — 6)  +  (y  — 2)  +  (z  —  1)  =  0 

12.5.2  4(x  +  1)  +5(y  —  2)  —  (z  +  3)  =  0 

12.5.3  (x  —  1)  —  (y  —  2)  =  0 

12.5.4  — 2(x—  1)  +  3y  — 2z  =  0 

12.5.5  4(x—  1)  —  6y  =  0 

12.5.6  x  +  3y  =  0 

12.5.7  (1,0,3) +f(0, 2,1) 

12.5.8  (1,0,3) +f(l, 2,-1) 

12.5.9  f(l,l,-l) 

12.5.10  -2/5,  13/5 

12.5.12  neither 

12.5.13  parallel 

12.5.14  intersect  at  (3,6,5) 

12.5.15  same  line 

12.5.19  1/V3 

12.5.20  4/\/l4 

12.5.21  Vm/VU 

12.5.22  ^68/3 

12.5.23  a/42/7 
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12.5.24  V21/6 

12.6.1  (a)  (V2,tt/4,1),  (V3,Jr/4,aiccos(l/V3)) 

(b)  (7a/2,77t/4,5),  (^l23,7;r/4,arccos(5/\/l23) 

(c)  (1,1,1),  (a/2,  1,^/4) 

(d)  (0,0,— 7r),  (7T,0,7r) 

12.6.2  r2  +  z2  =  4 

12.6.3  rcos0  =  0 

12.6.4  r2  +  2z2  +  2z-5  =  0 

12.6.5  z  =  e  r2 

12.6.6  z  —  r 

12.6.7  sin  0  =  0 

12.6.8  1— pcos^) 

12.6.9  p  =  2sin0  sin<)>. 

12.6.10  p sin^  =  2 

12.6.11  cos  (j)  =  1/V2 

12.6.13  z  =  mr;  cotcj)  =  m  if  m  ^  0,  (j)  =  0  if  m  =  0 

12.6.14  A  sphere  with  radius  1/2,  center  at  (0, 1/2,0) 

12.6.15  0  <  0  <  tt/2,  0  <  (j)  <  tt/2,  p  >  0;  0  <  0  <  tt/2,  r  >  0,  z  >  0 

13.1.1  z  —  y2,z  —  x2,z  —  0,  lines  of  slope  1 

13.1.2  z=  |y|,  z  =  \x\,  z  =  2|jc|,  diamonds 

13.1.3  z  —  e  r  sin(y2),  z  =  e~x~  sin(jr),  z  —  e~2x~  sin(2x2),  circles 

13.1.4  z  —  —  sin(y),  z  —  sin(jc),  z  —  0,  lines  of  slope  1 


13.1.5  z  —  y4,  z  —  x 4,  z  —  0,  hyperbolas 
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13.1.6  (a)  {(x,y)  |  |jc|  <  3  and  \y\  >  2} 

(b)  {(x,y)  |  1  <  x2  +y2  <  3} 

(c)  {{x,y)\x2  +  4y2<\6} 

13.2.1  No  limit;  use  x  =  0  and  v  =  0. 

13.2.2  No  limit;  use  x  =  0  and  .r  =  y. 

13.2.3  No  limit;  use  x  =  0  and  .r  =  y. 

13.2.4  Limit  is  zero. 

13.2.5  Limit  is  1. 

13.2.6  Limit  is  zero. 

13.2.7  Limit  is  —1. 

13.2.8  Limit  is  zero. 

13.2.9  No  limit;  use  x  —  0  and  >’  =  0. 

13.2.10  Limit  is  zero. 

13.2.11  Limit  is— 1. 

13.2.12  Limit  is  zero. 

13.3.1  —  2xysin(x2y),  —x2  sin(x2y)  +  3y2 

13.3.2  (y2 -x2y)/{x2  +y)2,  x3/(x2  +  y)2 

13.3.3  2xexl+)’1\  2 ye^^ 

13.3.4  jln(xy)  +y,  xln(xy)  +x 

13.3.5  —x/  \J\  —  x2  —  y2,  —yj  \J  1  —  x2  —  y2 

13.3.6  tany,  xsec2y 

13.3.7  —  1/  ( x2y ),  —  1/  (xy2) 


13.3.8  z  —  —2(x—  1)  —  3(y— 1)  —  1 
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13.3.9  z  =  1 

13.3.10  z  =  6{x  —  3)  +  3(y  —  1)  +  10 

13.3.11  z=  (jc-2)+4(y-l/2) 

13.3.12  r(f)  =  (2,1,4)  +t(2, 4,-1) 

13.3.16  height 

13.4.1  4xtcos(x2  +y2)  +  6yt2cos(x2  +y2) 

13.4.2  2xycost  +  2x2/ 

13.4.3  2xytcos(st)  +  2 xrs,  2xyscos(st )  +2x2t 

13.4.4  2xy2t  —  4 yx2s,  2 xy2s  +  4yx2t 

13.4.5  x/z,  3v/(2z) 

13.4.6  —2 x/z,  —y/z 

13.4.7  (a)  V'  =  (nR-  0.2  V) /P 

(b)  P'  =  (nR  +  0.6P)/2V 

(c)  r'  =  (3P-0.4V)/(n/?) 

13.5.1  9y/5/5 

13.5.2  v^cosS 

13.5.3  eV2(V3-l)/4 

13.5.4  v/3  +  5 

13.5.5  —  v/6(2  + v/3)/72 

13.5.6  -1/5,0 

13.5.7  4(x  —  2)  +  8(y  —  1)  =  0 

13.5.8  2(x  —  3)  +  3(y  —  2)  =  0 

13.5.9  (-1,-1  —  cosl,  — cosl),  —  \/2  +  2cosl  +  2 cos2  1 
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13.5.10  Any  direction  perpendicular  to  VT  =  (1, 1, 1),  for  example,  (—1, 1,0) 

13.5.11  2(x—  1)  —  6(y—  1)  +  6(z  — 3)  =  0 

13.5.12  6(x— l)  +  3(y  — 2)  +  2(z  — 3)  =0 

13.5.13  (2  +  4t,  — 3  —  12/,  — 1  —  8t) 

13.5.14  (4  +  St,2  +  At,  —2  —  36/ ) 

13.5.15  (4  +  8r,  2  +  20t,  6  —  12t) 

13.5.16  (0,1),  (4/5, -3/5) 

13.5.18  (a)  (4,9) 

(b)  (-81,2)  or  (81,-2) 

13.5.19  in  the  direction  of  (8, 1) 

13.5.20  Vg(  — 1,3)  =  (2,1) 

13.6.1  fxx  =  (2 x3y  -  6 xy3)/ {x2+y2)3,  fyy  =  (2 xy3  -  6 x3y)/{x2  +y2)3 

13.6.2  fx  =  3 x2y2,  fy  =  2x3y  +  5y4,  fxx  =  6 xy2,  fyy  =  2x3  +  20y3,  fxy  =  6x2y 


13.6.3  fx  =12 x2+y2,fy  =  2 xy, 

fxx  =  24x,  f^  =  2x,  fxy  =  2 y 

13.6.4  fx  =  siny,  fy  =  xcosy,  fxx  =  0,  fyy  =  -xsin y,  fxy  =  cosy 

13.6.5  fx  =  3cos(3x)cos(2y), 
fy  —  —  2sin(3x)sin(2y), 

fxy  =  — 6cos(3x)  sin(2y), 
fyy  =  — 4sin(3x)  cos(2y), 
fxx  =  —  9sin(3x)  cos(2y) 

13.6.6  fx  =  ex+r,  fy  =  2 yex+r, 

fxx  =  ex+y\ 

fyy  =  4y2ex+yl  +  2ex+y2, 
fXy  =  2yeK+y2 

11^7  f  _  3-y2  r  _  2-V3  ,  _ 9x4  _  6  y2 

h  2(x3  +  y4)  ’  fy  x3  +  y4  ’  hx  x3  +  y4  2(x3  +  y4)2  ’  hy  x3  +  y4 
*  _  —6x2y3 
}xy~  (*3+v4)2 


8y6 

(x3  +y4)2’ 
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13.6 

Zxx  ~ 

Zyy  = 

Zxy  - 


Z,x  —  .  £  i  2 y 

16z 

16z2+x2 

162z3  ’ 

4  z2+y2 


- y 

4  z  ’ 


16z3 


-*y 

64z3 


_  y+z  x+z 

y  zx  =  zy 


13.6 

Zxx  ~ 

Zxy  ~ 

13.7.1  minimum  at  (1,-1) 


x+y  ■  x  +  y 
o  y+z  7  =o  *  +  z 
(x+y)2  ” 

2  z 

(x+y)2 


(x+y)2. 


13.7.2  none 


13.7.3  none 

13.7.4  maximum  at  ( 1,  — 1/6) 

13.7.5  none 

13.7.6  minimum  at  (2,-1) 

13.7.7  /(2,2)  =  —2,  /(2,0)  =  4 

13.7.8  a  cube  1  / \fl  on  a  side 

13.7.9  65/3  x  65/3  x  130/3 

13.7.10  It  has  a  square  base,  and  is  one  and  one  half  times  as  tall  as  wide.  If  the  volume  is  V  the  dimensions 
are  ^2E/3  x  ^2V/3  x  {/9V/4. 

13.7.11  a/  100/3 

13.7. 12  |  axo  +  ^vo  +  czo  —  r/ 1  /  \/ c/2  +  &2  +  c2 

13.7.13  The  sides  and  bottom  should  all  be  2/3  meter,  and  the  sides  should  be  bent  up  at  angle  n/3. 

13.7.14  (3,4/3) 

13.7.16  \b\  if  b  <  1/2,  otherwise  y/b—\/4 
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13.7.17  \b\  if  b  <  1/2,  otherwise  y/b—l/4 
13.7.19  1024/  \/3 

13.8.1  a  cube,  Z/Yjl  x  ^/YJl  x  {/TJl 

13.8.2  65/3  ■  65/3  •  130/3  =  2  •  653/27 

13.8.3  It  has  a  square  base,  and  is  one  and  one  half  times  as  tall  as  wide.  If  the  volume  is  V  the  dimensions 
are  ^2E/3  x  0Vj3  x  ^/9V/4. 

13.8.4  |  ax o  +  byo  +  czo  ~  d  \  / \J a 2  +  b2  +  c2 

13.8.5  (0,0,1),  (0, 0,-1) 

13.8.6  ffiV  x  \/4V  x  ^/VJl 6 

13.8.7  Farthest:  (— 2  +  2\/2);  closest:  (2,0,0),  (0, —2,0) 

13.8.8  x  =  y  =  z  =  16 

13.8.9  (1,2,2) 

13.8.10  (^,0,0),  (-^,0,0) 

13.8.11  standard  $65,  deluxe  $75 

13.8.12  x  —  9,  (j)  —  K /3 

13.8.13  35,  -35 

13.8.14  maximum  e 4,  no  minimum 

13.8.15  5,  -9/2 

13.8.16  3,  3,  3 

13.8.17  a  cube  of  side  length  2/V3 

14.1.1  16 

14.1.2  4 


14.1.3  15/8 
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14.1.4  1/2 

14.1.5  5/6 

14.1.6  12  —  65/ (2e). 

14.1.7  1/2 

14.1.8  7T/64 

14.1.9  (2/9)23/2  —  (2/9) 

14.1.10  (1  —  cos(l))/4 

14.1.11  (2y/2-  l)/6 

14.1.12  n  —  2 

14.1.13  (c9  —  l)/6 

4  n 

14114  15' "4 

14.1.15  1/3 

14.1.16  448 

14.1.17  4/5 

14.1.18  8tt 

14.1.19  2 

14.1.20  5/3 

14.1.21  81/2 

14.1.22  2a3  / 3 

14.1.23  4k 

14.1.24  tt/32 

14.1.25  31/8 

14.1.26  128/15 
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14.1.27  1 8007T  m3 

( e 2  +  8e  +  16) 


14.1.28 


15 


\fe  +  ~4- 


5^5 


14.1.30  16  —  8\/2 


e5/2 

~15 


+ 


1 

15 


14.2.1  4  K 

14.2.2  32tt / 3  —  4\/37r 

14.2.3  (2-s/2)k/3 

14.2.4  4/9 

14.2.5  5^/3 

14.2.6  7t/6 

14.2.7  k/2 

14.2.8  tt/2  —  1 

14.2.9  v/3/4  +  tt/6 

14.2.10  8  +  7T 


14.2.11  %f  12 

14.2.12  (1— cos(9))7t/2 

14.2.13  -a5/15 

14.2.14  12; T 


14.2.15  n 

14.2.16  16/3 

14.2.17  21 K 


3  2k 

14.3.1  x  =  y  =  2/2 

14.3.2  x  =  4/5,  y  =  8/15 
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14.3.3  x  =  0,  y  =  3k/  16 

14.3.4  x  =  0,;y  =  16/(15tt) 

14.3.5  x—  3 /2,y  —  9  /  A 

14.3.6  x=6/5,y=  12/5 


14.3.7  x=  14/27,  y  =  28/55 


14.3.8  (3/4, 2/5) 


14.3.9 


14.3.10  x  =  k/2,  y  —  k/ 8 

r2n  /-1+COS0 


14.3.11  M 

Mx  = 


Jo  Jo 

2  K  /-1+COS0 


JO  JO 
r  2k  r  1-j-cos  0 


My  — 


Jo  Jo 


(2  +  cos  G)rdrdG, 
sin  0(2  + cos  0)r2  drdO, 
cos0(2  +  cos0)r2<ir£/0. 


14.3.12  M 

Mx  = 


k/2  pcosd 

/  (r  +  l)rdrdG, 
J — k/2  Jo 

k/2  pcosO 

sin0(r+  1  )r2drd0, 


My  = 


-k/2  Jo 

(■k/2  p cos  6 


-k/2  Jo 


cos  0(r  +  1  )r2drd0. 


14.3.13  M 

Mx  = 


'■k/2  /-l+cosO 


—  k/2  J cos0 
k/2  /-1+COS0 


rdrdG + 


r'in/2  i- l+cos0 


My  = 


-  k/2  J  cos  0 
ck/2  p  1+cos0 

-  k/2  J  cos  0 


r sinGdrdO  + 


rl  cos  0  dr  dG  + 


I  k/2  Jo 

p'iK/2  /-1+COS0 

I  k/2  Jo 

c2k/2  p  1  -(-cos  0 


I  k/2  Jo 


14.4.1  naVh2  + , 


14.4.2  Kcr\J m2  +  1 


14.4.3  \/3/2 

14.4.4 


rdrdG , 
'sin0rfr<i0, 
2  cos  GdrdG. 


14.4.5  nVl/% 

14.4.6  7r/2  —  1 
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c/2^ a2  - \-b2  +  c 2 

14  A?  - 2^ - 

14.4.8  8V3?r/3 

14.5.1  11/24 

14.5.2  623/60 

14.5.3  — 3e2/4  +  2e  — 3/4 

14.5.4  1/20 

14.5.5  tt/48 

14.5.6  11/84 

14.5.7  151/60 

14.5.8  n 

14.5.10  ^ 

16 

14.5.11  32 

14.5.12  64/3 

14.5.13  x  =  y  =  0,  z  =  16/ 15 

14.5.14  jc  =  y  =  0,z=  1/3 

14.6.1  7T/12 

14.6.2  1-V2/2) 

14.6.3  5tt/4 

14.6.4  0 


14.6.5  5^/4 
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14.6.6  4/5 

14.6.7  2567T/15 

14.6.8  4k2 

14.6.9  % 

14.6.10  nkh2a2/n 

14.6.11  Kkha2/ 6 

14.6.12  n2  / 4 

14.6.13  47T/5 

14.6.14  15k 

14.6.15  9^(5v/2-2v/5)/20 

14.7.1  4ttv/3/3 

14.7.2  0 


14.7.3  2/3 


14.7.4 


c2-l 

2e2 


14.7.5  36 


14.7.6  32(v/2  +  ln(l  +  v/2))/3 

14.7.7  3cos(l)  —  3cos(4) 

14.7.8  7r(l  — cos(l))/24 
14.7.10  (4/3)7 labc 

15.1.5  (3cost,3sinr,2  — 3sint) 


15.1.6  (0  ,tcost,tsint) 
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15.2.2  (— sint,2cos2r,2t),  r'/y4in2t  +  4cos2(2r)  +4 12 

15.2.3  (  — ef  sin(ef),er cos(et),cost),  r'/ \/ e2t  +  cos2 1 

15.2.4  (a/2/2,  a/2/2,  It/ 4)  +  f  (-a/2/ 2,  a/2/2, 1) 

15.2.5  ( 1/2,  a/3/2,- 1/2) +f(- a/3/2, 1/2,2a/3) 

15.2.6  2/v/5/\/4  +  7T2 

15.2.7  7v/5v/17r/85,  -9v/5v/l7/85 

15.2.9  (Oncost, tsint),  (O.cost  — tsint,sint  +  tcost),  r'/vTT/2,  \/l  +t2 

15.2.10  (sin?,  1 —cost, t2/2) 

15.2.11  t  =  4 

15.2.12  37,  1 

15.2.13  (?2/2,?3/3,sin?) 

15.2.16  (1, 1, 1)  when  t  —l  and  s  —  0\B  —  arccos(3/\/T4);  no 

15.2.17  — 6x+  (>’  —  k)  =  0 

15.2.18  -x/V2  +  y/V2  +  6z  =  0 

15.2.19  (-1,-3, 1) 

15.2.20  (l/V2,l/V2,0)+f(-l,l,6A/2) 

15.3.1  2n\/l3 

15.3.2  (—8+  13v/l3)/27 

15.3.3  v/5/2  +  ln(v/5  +  2)/4 

15.3.4  (85a/85-13a/T3)/27 

15.3.5  /0Vl  +e2tdt 

15.4.1  2  a/2/ (2  +  4t2)3/2 

15.4.2  2a/2/(1  +  8r2)3/2 
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15.4.3  V3600f 10  +  400/6  +  36/2/(l  +  9 t4  +  25 z8)3/2 

15.4.4  12\/l7/289 

15.5.1  (5/4,4/,  1),  (20/3,4,0),  aT  =  100/7  +  16//V25/8  +  I6t2,  aN  =  V3600/8  +  400/6  -  16/V25/8  +  16/2 

15.5.2  (—sin/, cost, It),  (—cost,  —  sin/,2),  4//a/4/2  +  1,  \/4/2  +  5/a/4/2  +  1 

15.5.3  (—  sin/, cos/, er),  (— cos/,  — sin/,ef),  e2t  j  \J  e2i  +  1 ,  a/ 2e2/  +  1  / vV2f  +  1 

15.5.4  (er  ,cost,er),  (ef,  —  sin/,er),  (2e2r  —  cos/sin/)/v/2e2'~+cos27,  a/2^ |  cos  /  +  sin  / 1  /  \/ 2e2?  +  cos2 / 

15.5.5  (—3  sin/, 2 cos /, 0) ,  (3 cos /, 2 sin/, 0) 

15.5.6  (— 3sin/,2cos/4-0.1,0),  (3cos/,2sin/  +  //10,0) 

15.5.7  (— 3sin/,2cos/,  1),  (3cos/,2sin/,/) 

15.5.8  (— 3sin/,2cos/+  1/10, 1),  (3cos/,2sin/  +  //10,/) 

16.2.1  -1,0 

16.2.2  0,  a  +  b 

16.2.3  (2b  —  a)/3,  0 

16.2.4  0,  1 

16.2.5  —In,  0 

16.2.6  0,  In 

16.3.1  13VTT/4 

16.3.2  0 

16.3.3  3sin(4)/2 

16.3.4  2e3 

16.3.5  128 

16.3.6  (9<?  — 3)/2 

16.3.7  ee+1  -  -  el!e~l  +  ex!e  +  e4/4  -  e~4/4 
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16.3.8  1 +  sin(l)  —  cos(l) 

16.3.9  31n3  —  21n2 

16.3.10  3/20+  101n(2)/7 

16.3.11  21n5-21n2+ 15/32 

16.3.12  1 

16.3.13  0 

16.3.14  21  +cos(l)  — cos(8) 

16.3.15  (ln29  — ln2)/2 

16.3.16  2  In  2  +  7T/4  —  2 

16.3.17  1243/3 

16.3.18  ln2+  11/3 

16.3.19  3  cos(l)  —  cos(2)  —  cos(4)  —  cos(8) 

16.3.20  -10/3 

16.3.22  no  / 

16.3.23  x4/4-//5 

16.3.24  no  / 

16.3.25  no  / 

16.3.26  _ysinx 

16.3.27  no  / 

16.3.28  xyz 

16.3.29  414 

16.3.30  6 

16.3.31  \/e  —  sin  3 
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16.3.32  \/y/il-\/y/3 

16.4.1  1 

16.4.2  0 

16.4.3  l/(2e)-\/(2e1)  +  e/2-e1  /2 

16.4.4  1/2 

16.4.5  -1/6 

16.4.6  (2v/3-10v/5  +  8v/6)/3- 2^/5 +  1/5 

16.4.7  11/2  — ln(2) 

16.4.8  2-71/2 

16.4.9  -17/12 

16.4.10  0 

16.4.11  -n/2 

16.4.12  127T 

16.4.13  2cos(l)  —  2sin(l)  —  1 

16.5.1  both  are —45^/4 

16.5.2  a2bc  +  ab2c  +  abc2 

16.5.3  e2  -  2e-\-l /2 

16.5.4  3 

16.5.5  384tt/5 

16.5.6  7t/3 

16.5.7  10tt 

16.5.8  n/2 
16.6.2  25VTi/A 
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16.6.3  nVTi 

16.6.4  tt(5\/5  —  l)/6 

16.6.5  4kV2 

16.6.6  na1/ 2 

16.6.7  In  a  (a  —  Va2  —  b2) 

16.6.8  7r((l  +4a2)3/2  —  l)/6 

16.6.9  —  2r/2 

16.6.10  7la2Vl  +  k2/4 

16.6.11  A\/l  +a2  +  62 

16.6.12  AVk2  + 1 

16.6.13  8a2 

16.7.1  (0,0, 3/8) 

16.7.2  (11/20,11/20,3/10) 

16.7.3  (0,0,1364/425) 

16.7.4  on  center  axis,  h/3  above  the  base 

16.7.5  16 

16.7.6  7 

16.7.7  -K 

16.7.8  -2/e 

16.7.9  Kb2(-4b4-3b2  +  6a2b2  +  6a2)/6 

16.7.10  9280  kg/s 

16.7.11  24e0 

16.8.1  —3k 

16.8.2  0 

16.8.3  —4k 

16.8.4  3k 

16.8.5  A(p(c  —  b)  +  q(a  —  c)+a  —  b ) 


Index 


absolute  extrema,  174,  210 
absolute  maximum,  174 
absolute  minimum,  174 
multivariate,  483 
absolute  value,  14 
properties,  14 
acceleration  vector,  542 
normal  component,  542 
tangential  component,  542 
antiderivative,  238 
arc  length  parameterization,  537 
asymptote 

horizontal,  206 
vertical,  205 

Cartesian  coordinates,  405,  450 
center  of  mass,  507 
central  angle,  33 
centroid,  509 
chain  rule,  137 

multivariate,  47 1 
characteristic  polynomial,  394 
Clairaut’s  theorem,  479 
comparison  theorem,  360 
completing  the  square,  6 
conic  section,  419 
conics 

circle,  26 
ellipse,  26 

horizontal  hyperbola,  27 
horizontal  parabola,  26 
standard  form,  25 
vertical  hyperbola,  27 
vertical  parabola,  25 
critical  point,  171 
cross  product,  440 
properties,  442 
curl,  547 

curvature,  539,  540 
cylindrical  coordinates,  451 
convert  to  rectangular,  452 
triple  integral,  515 


definite  integral,  241 
properties,  242,  243 
del  (V),  474 

derivative,  117,  121,  130 
acceleration,  128 
chain  rule,  137 
differentiable,  125 
exponential  function,  143,  145 
implicit  differentiation,  148 
intervals  of  increase  and  decrease,  200 
leibniz  notation,  122 
logarithmic  differentiation,  152 
logarithmic  function,  145 
nth  derivative,  127 
of  the  inverse,  156 
rate  of  change,  163 
rules,  130 

second  derivative,  127 
trigonometric  functions,  136 
velocity,  119,  128 
difference  quotient,  115 
differentiable,  469 
differential  equation,  250,  379 
Euler’s  method,  390 
first  order,  379 
general  solution,  382 
initial  value  problem,  251,  380,  385,  387 
integrating  factor,  388 
method  of  undetermined  coefficients,  397 
particular  solution,  382 
second  order,  393 
second  order  linear,  397 
separable,  381 
separation  of  variables,  382 
variation  of  parameters,  387 
differentials,  186 
direction  vector,  446 
directional  derivative,  475 
discriminant,  481 
distance  formula,  24,  426 
divergence,  547 
theorem,  562 
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dot  product,  434 
properties,  438 
double  integral,  494 

elementary  function,  49 
Euler’s  method,  390 
extreme  value  theorem,  174 

factorial  notation,  188 

Fermat’s  theorem,  170 

first  order  differential  equation,  379 

first  order  homogeneous  linear  equation,  384 

first  order  initial  value  problem,  380 

first  order  linear  differential  equation,  387 

flux,  571 

function,  45 

composition,  52 
continuous,  102,  291 
dependent  variable,  47 
discontinuous,  104 
domain,  45 
elementary,  49 
exponential  function,  54 
independent  variable,  47 
infinite  discontinuity,  105 
inverse,  57 

jump  discontinuity,  105 
left  continuous,  107 
linear,  45 

logarithmic  function,  60 
natural  logarithm,  61 
of  two  variables,  463 
one-to-one,  58 
removable  discontinuity,  105 
right  continuous,  107 
same  derivative,  182 
symmetry,  206 
transcendental,  49 

fundamental  theorem  of  calculus,  238,  291 
and  chain  rule,  246 

Gauss’  Theorem,  563 
global  extrema,  174,  210 
gradient,  474,  545,  554 
greatest  integer,  173 
Green’s  theorem,  557 


hyperbolic  cosine,  69 
hyperbolic  sine,  69 

implicit  differentiation,  148,472 
improper  integral,  292,  293 
comparison  test,  298 
converges,  292 
diverges,  292 
indefinite  integral,  241 
indeterminate  form,  194 
inequality  notation,  8 
inflection  points,  204 
integral  sign,  238 
integral  test,  353 
integrating  factor,  388 
integration,  248 

antiderivative,  238 
area,  221 
by  parts,  278 
change  of  variables,  519 
common  mistakes,  248 
definite 

Riemann  Sums,  236 
displacement,  219 

error  for  trapezoid  approximation,  287 
over  a  curve,  549 

products  of  secant  and  tangent,  266 
products  of  sine  and  cosine,  260 
rules,  249 

substitution  rule,  253,  254,  258 
trapezoid  approximation,  286 
trigonometric  substitution,  272 
intermediate  value  theorem,  111,  190 
intersection,  3 
interval,  9 

closed  interval,  9 
open  interval,  9 
interval  of  convergence,  367 
irrational,  4 

Jacobian,  522 

L’Hopital’s  rule,  194 
Lagrange  multipliers,  486 
Left  Hand  Rule,  223,  228 
level  curve,  459 
limit,  75,  77 


horizontal  asymptote,  90 
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at  infinity,  87 
indeterminate  form,  194 
infinite,  88 
left-hand,  76 

multivariate  function,  462 
one-sided,  76,  80 
right-hand,  76 
line,  19 

direction  vector,  446 
general  form,  21 
parametric  equations,  447 
point-slope  form,  20 
slope,  19 

slope-intercept  form,  20 
vector  equation,  446 
y-intercept,  20 
line  integral,  549 
closed  paths,  555 
fundamental  theorem,  554 
linear  approximation,  184 
local  extrema,  169 
multivariate,  48 1 
local  maximum,  169,  199,  201 
local  minimum,  169,  199,  201 
logarithmic  differentation,  152 

Maclaurin  series,  371 

mean  value  theorem,  178,  180 

method  of  undetermined  coefficients,  397 

midpoint,  24 

Midpoint  Rule,  223,  228 

mixed  partials,  479 

moment,  507 

Newton’s  method,  190,  193 
normal  vector,  443 
number  systems,  3 

complex  numbers,  4 
integers,  4 
natural  numbers,  4 
rational  numbers,  4 
real  numbers,  4 

orientable  surface,  572 

p-series,  353 
parameter,  415 


parametric  equations,  415,  447,  526 
partial  fraction  decomposition,  284 
plane 

normal  vector,  443 
standard  form,  445 
polar  coordinates,  405,  450 
conic  section,  419 
convert  to  rectangular,  406,  450 
double  integral,  502 
parametric  equations,  415 
polynomial 

of  two  variables,  463 
power  series,  366,  368 
centered  at  c,  367 

quadratic  formula,  7,  282 

radians,  32 

radius  of  convergence,  367 
rational  function,  281 
rationalizing,  85 
rectangular  coordinates,  405 
Riemann  Sum,  222,  227,  230,  236 
Right  Hand  Rule,  223,  228 
rolle’s  theorem,  179 

second  derivative 
concavity,  203 
inflection  points,  204 
second  derivative  test,  201 
second  order  differential  equation,  393 
second  order  homogeneous  equation,  393 
solutions,  394 

second  order  linear  equation,  397,  401 
steady  state  solution,  399 
transient  solution,  399 
variation  of  parameters,  402 
sequence,  340 
bounded,  345 
converves,  340 
decreasing,  344 
diverges,  340 
geometric  sequence,  344 
increasing,  344 
limit,  341 
monotonic,  344 
non-decreasing,  344 
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non-increasing,  344 
properties,  342 
sequence  of  partial  sums,  347 
series,  339 

absolute  convergence,  361 
alternating  harmonic,  356 
conditional  convergence,  361 
convergent,  347 
divergent,  347 
geometric,  347 
harmonic,  350 
ratio  test,  364 
root  test,  364 
set,  3 

sigma  notation,  225,  237 
Simpson’s  rule,  289 
slant  asymptote,  91 
slope  field,  391 
spherical  coordinates,  453 
convert  to  rectangular,  453 
triple  integral,  515 
squeeze  theorem,  98 
Stokes’  theorem,  574 
subset,  4 
summation 

notation,  225 
properties,  226 

tangent  line,  116 
tangent  plane,  469 
Taylor  polynomials,  188 
Taylor  series,  373,  374 
transcendental  function,  49 
transformations,  50 
triangle  inequality,  14 
trigonometry 
arccosine,  65 
arcsine,  63 
arctangent,  66 
CAST  rule,  36,  37 
cosecant,  33 
cosine,  33 
cotangent,  34 
identities,  40 
secant,  34 
sine,  33 


special  triangles,  36 
tangent,  33 
triple  integral,  511 

union,  3 
unit  circle,  35 
unit  vector,  436 

variation  of  parameters,  387 
vector,  429 

angle  between,  433 
anti-parallel,  435 
parallel,  435 
perpendicular,  435 
projection,  435 
scalar  multiplication,  430 
scalar  projection,  436 
sum,  429 
vector  field,  545 
conservative,  554 
curl,  547 
divergence,  547 
vector  function,  526,  528 
curvature,  539,  540 
derivative,  529 
derivative  properties,  532 
standard  unit  tangent  vector,  533 
unit  binormal,  541 
unit  normal,  541 
vertical  asymptote,  89 

work,  322 
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